Gene Jann_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3804 
Symbol 
ID3936284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3889368 
End bp3890873 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content63% 
IMG OID637906182 
Productextracellular solute-binding protein 
Protein accessionYP_511746 
Protein GI89056295 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000726853 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000406829 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTCGA TCAAAACAAC CGCCCGCGCG CTGGCACTCA TAAGCACCGC CCTCGCGGCC 
CCATTGTCCG CGCAAACGCT TGATCTTGCA TGGTCCCAGG ACGCCACCGG CCTTGATCCG
CACACGCAGC CCGGCTTCGC GACGATCCGC CTGCTGGAAT TGATGTATGA GCCGCTTTTG
CGTCTGGATG CCAACCTGGA GCTTCAGCCC GCCATCGCGC AAAGCTGGTC CTTCTCCGAC
GATGGTCTGC AACTGACATT CCAACTGGAC CCCGCGGCGA TGTTCCACGA CGGCACATCC
GTGACCTCTG CCGATGTCCG CGCCTCGTTC GAGCGTATTC TGGACGAGGA AACCGGCGCG
ATTTCGCGGG CGAACTACAC GTCCATCATC AATATCGAAA CACCCGATGA TGCCACCGTG
GTGTTTGAAC TGGATCGCCC CGATGCGCCG ATCCTCAACG GTCTGGCCAC GGTGAACGCC
GCCGTCCTGC CTGCGTCGGC GATTGAGGCA GGCACGATCG CCACGGAAGT CGTGGGCTCC
GGCCCGTTCA TGTTGGACGC CCGCACGCCC AACGCCAGCG CCACCCTGAC GAGTTTCGCG
GATTGGCATG GCGGCGACGT GGCCTACGAC ACCCTGTCCA TCAGCGTGCT GCCCGATGAA
ACCGCGCTTC TGGGTGCCTT GCGTGCCGGT CAGGCCGATT TTGCATTGAT CAACGATCCG
TTGGTCGCGA CGCTGGTGCC CTCTACGGAT GGATTGACGC TGAACACCGC GCCCACGCTC
AGCTACTATG TGCTGCAACT CAACGCAGCC CGGGAGCCGA TGGACTCGCT GCCCCTGCGG
CAGGCGATCA GCTGCGCGAT CAACCGCCAG GACATTCTGG ATGCCGCCCT TCTGGGTGAG
GGGGAGGTGA CGGGTCCCCT GACATCCCCC GCCTATCGCA CCGACCCAAG CAGCCTGTTC
TGCTATGAGC AGGATCAGGA TCGCGCCCGC GCGTTGCTGG CAGAGGCGGG CTTTGCCGAT
GGCTTCACGG CCACCGTCAT GGCCGCAACC GGTGAGCCGC CCACCGCATC GGCCGTGGCG
CAGGTGATCC AGTCGCAACT GTCTGAGGTC GGCATCACGC TTGAGATCGA GATGCAGGAG
CTGAGCGTCT ATATCGACCG CTGGCTTGCC GCGGATTTCG ACATGGCCGT GGCGCTGAAC
GGCGGGCGCG TGGACCCCTA TACGATGTAC AACCGCTACT GGACCCGCGA CGGGAACCTG
CAAGGCGTCG CCAACTACAT CGATGATACA CTCGACACGC TGATGAACGA TGGCCGGGCC
GAGACGGGCG AAGAGGCCCG CCGGGAGATC TATGCCAACT TCGAGTCCCA TCTGGCCGAA
ATGTCGCCCT GGGTCTGGCT GTTCACTGGC AACACCTACA CGGCTCAGAC CGACGCGGTC
TCCGGATTCG TTCCCACGCC CAACGGATCG CTCTTCGGCC TCGTGGATGT GACCCTGGCT
GAATAA
 
Protein sequence
MTSIKTTARA LALISTALAA PLSAQTLDLA WSQDATGLDP HTQPGFATIR LLELMYEPLL 
RLDANLELQP AIAQSWSFSD DGLQLTFQLD PAAMFHDGTS VTSADVRASF ERILDEETGA
ISRANYTSII NIETPDDATV VFELDRPDAP ILNGLATVNA AVLPASAIEA GTIATEVVGS
GPFMLDARTP NASATLTSFA DWHGGDVAYD TLSISVLPDE TALLGALRAG QADFALINDP
LVATLVPSTD GLTLNTAPTL SYYVLQLNAA REPMDSLPLR QAISCAINRQ DILDAALLGE
GEVTGPLTSP AYRTDPSSLF CYEQDQDRAR ALLAEAGFAD GFTATVMAAT GEPPTASAVA
QVIQSQLSEV GITLEIEMQE LSVYIDRWLA ADFDMAVALN GGRVDPYTMY NRYWTRDGNL
QGVANYIDDT LDTLMNDGRA ETGEEARREI YANFESHLAE MSPWVWLFTG NTYTAQTDAV
SGFVPTPNGS LFGLVDVTLA E