Gene Ent638_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3159 
SymbolproX 
ID5111712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3439581 
End bp3440576 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content53% 
IMG OID640493358 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_001177874 
Protein GI146312800 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.138504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATA ACGTACTTTT TGCCACAGCG TTTGCCACCC TTGTCTCAAC CAGCACATTT 
GCGGCTGATC TCCCGGGCAA AGGCATTACC GTTCAACCGG TTCAGAGCAC CATTTCCGAA
GAGACGTTCC AGACCATGAT CGTCAGCCGT GCGCTGGAAA AACTGGGCTA TACGGTCAAT
AAGCCAAGTG AAGTGGATTA CAACGTGGGC TATACCTCGA TCGCCTCTGG CGACGCCACG
TTCACCGCCG TTAACTGGAA GCCGCTGCAT GATGATATGT ACGCTGCTGC GGGCGGGAGT
AAAAAATTCT ATCGCGAAGG AACATTTGTG ACCGGTGCGG CGCAGGGCTA TCTGATCGAC
AAGAAAACCG CCGAGAAATA CCACATCACC AATATCGAGC AGTTGAAAGA TCCGAAGATC
GCCAAACTGT TCGACACCAA CGGTGACGGT AAAGCCGACA TGATGGGCTG CTCCCCAGGC
TGGGGTTGTG AAGCGGTGAT TAACCATCAG AACAAAGCGT TCGATCTTGA GAAGACCGTT
GACGTGAGCC ACGGGAATTA CTCGGCGATG ATGGCGGATA CTATCGCGCG CTTTAAAGAA
GGCAAACCAG TTATCTATTA CACCTGGACT CCATACTGGG TGAGCGATGT GTTGAAGCCG
GGTAAAGATG TAGTGTGGCT GCAGGTGCCG TTCTCCTCTC TGCCAGGCGA ACAGAAAGAT
ATCGACACCA AGCTGCCGAA CGGCATGAAC TATGGCTTCC CGGTGAATAC GATGCATATC
GTGGCGAACA AAGCCTGGGC AGAGAAAAAC CCGGCGGCGG CGAAACTGTT CTCCGTGATG
AAACTGCCCC TGGCGGATAT CAACGCGCAG AACGCGATGA TGCATGAAGG CAAATCGTCC
GATGCAAATA TTCAGGGTCA CGTTGACGGC TGGATCAAAG CCCACCAGCA GCAGTTTGAT
GGCTGGGTGA AAGAGGCGCT GGCCGCACAG AAATAG
 
Protein sequence
MRHNVLFATA FATLVSTSTF AADLPGKGIT VQPVQSTISE ETFQTMIVSR ALEKLGYTVN 
KPSEVDYNVG YTSIASGDAT FTAVNWKPLH DDMYAAAGGS KKFYREGTFV TGAAQGYLID
KKTAEKYHIT NIEQLKDPKI AKLFDTNGDG KADMMGCSPG WGCEAVINHQ NKAFDLEKTV
DVSHGNYSAM MADTIARFKE GKPVIYYTWT PYWVSDVLKP GKDVVWLQVP FSSLPGEQKD
IDTKLPNGMN YGFPVNTMHI VANKAWAEKN PAAAKLFSVM KLPLADINAQ NAMMHEGKSS
DANIQGHVDG WIKAHQQQFD GWVKEALAAQ K