Gene EcolC_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1027 
SymbolproX 
ID6066751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1116046 
End bp1117038 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID641600440 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_001724023 
Protein GI170019069 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.23532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT 
GCTGCCGATC TGCCGGGCAA AGGCATTACT GTTAATCCAG TTCAGAGCAC CATCACTGAA
GAAACCTTCC AGACGCTGCT GGTCAGTCGT GCGCTGGAGA AATTAGGTTA TACCGTCAAC
AAACCCAGCG AAGTAGATTA CAACGTTGGC TACACCTCGC TTGCTTCCGG CGATGCAACC
TTCACCGCCG TGAACTGGAC GCCACTGCAT GACAACATGT ACGAAGCTGC CGGTGGCGAT
AAGAAATTTT ATCGTGAAGG GGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT
AAGAAAACCG CCGACCAGTA CAAAATCACC AACATCGCAC AACTGAAAGA TCCGAAGATC
GCCAAACTGT TCGATACCAA CGGCGACGGA AAAGCGGATT TAACCGGTTG TAACCCTGGC
TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGAACTGAC CCATACCGTG
ACGCATAATC AGGGGAACTA CGCGGCGATG ATGGCCGACA CCATCAGTCG CTACAAAGAG
GGCAAACCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAATGA GCTGAAGCCA
GGGAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCCGCAC TGCCGGGCGA TAAAAACGCC
GATACCAAAC TGCCGAATGG TGCGAATTAT GGCTTCCCGG TCAGCACCAT GCATATCGTT
GCCAACAAAG CCTGGGCCGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG
TTGCCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA
GGCGATATTC AGGGCCATGT TGATGGCTGG ATCAAAGCCC ACCAGCAGCA GTTCGATGGC
TGGGTGAATG AGGCGCTGGC AGCGCAGAAG TAA
 
Protein sequence
MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN 
KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID
KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYELTHTV
THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA
DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE
GDIQGHVDGW IKAHQQQFDG WVNEALAAQK