Gene Jann_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1497 
Symbol 
ID3933944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1467824 
End bp1468768 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content63% 
IMG OID637903847 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_509439 
Protein GI89053988 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000845092 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTT TTCGCGCAAC TGCGACCAGT GCCCTCGCGT TAACCCTGAC CGCCGGGGCC 
GCGATGGCCG ACGCCCATGC AGACTGTGGC ACCGTGACCT TCTCGGACGT CGGTTGGACG
GATATCACCG CCACGACAGC CGCGACCTCC GTCGTGCTGG AGGCTTTGGG CTATGAGACC
GAGATCCTCG TTCTGTCCGT TCCGGTGACC TACACGTCGC TTGCGGAAGG GGATGTGGAT
ATCTTCCTGG GCAACTGGAT GCCGACGATG GAAGCCGACA TCGCGCCCTA TCGTGAGGCG
GGCACCGTCG ATACCGTCCG CGCCAACCTT GAGGGCGCGA AGTACACGCT GGCCACGAAC
GCCGCCGGGG CCGCCCTCGG GATCACTGAT TTCGCCTCCA TCGTGGCGGC CATGGACGAG
CTGGATGGCG AGATCTACGG CATTGAGCCC GGCAACGACG GCAACCGCCT GATCATGGAT
ATGATCGAGG CCGACGCCTT TGGTCTGAGC GAGTTTGAAG TCGTCGAATC CTCCGAGCAG
GGCATGCTGG CGCAGGTCGC CCGCGCCTCT GACCGGGACG AGCCCGTCGT TTTCCTCGGC
TGGGAACCGC ATCCGATGAA TGCCAATTTC GATCTGACCT ATCTGGAAGG CGGCGATGAT
TGGTTCGGCC CCAATCTGGG TGGCGCGACG GTCTTCACCA ACACCTCCGC TGGCTACGCG
GACGCCTGCC CGAACGTCGG CGCGCTTCTG AACAATCTGG AATTCAGCCT CGCCATGGAG
AACGAAATCA TGGGCGCGAT CCTGGACGAG GGCGAAGATC CTGCCGATGC CGCAACGGCC
TGGATGGCCG CCAATCCGGA TGCTGTGATG GCCTGGCTTG ACGGTGTAAC AACCTTCGAC
GGCGGCGACG CATCCGCGGC CGTGTCCGAG GCACTGGGCC TCTAA
 
Protein sequence
MTLFRATATS ALALTLTAGA AMADAHADCG TVTFSDVGWT DITATTAATS VVLEALGYET 
EILVLSVPVT YTSLAEGDVD IFLGNWMPTM EADIAPYREA GTVDTVRANL EGAKYTLATN
AAGAALGITD FASIVAAMDE LDGEIYGIEP GNDGNRLIMD MIEADAFGLS EFEVVESSEQ
GMLAQVARAS DRDEPVVFLG WEPHPMNANF DLTYLEGGDD WFGPNLGGAT VFTNTSAGYA
DACPNVGALL NNLEFSLAME NEIMGAILDE GEDPADAATA WMAANPDAVM AWLDGVTTFD
GGDASAAVSE ALGL