Gene EcolC_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3820 
SymbolulaA 
ID6067199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4175699 
End bp4177096 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content53% 
IMG OID641603232 
ProductPTS system ascorbate-specific transporter subunit IIC 
Protein accessionYP_001726751 
Protein GI170021797 
COG category[S] Function unknown 
COG ID[COG3037] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00675829 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGATCC TCTACAACAT CTTTACCGTG TTTTTTAACC AGGTCATGAC CAATGCCCCG 
TTGTTGCTGG GTATTGTGAC CTGTCTGGGC TACATCCTAC TGCGCAAAAG TGTCAGCGTT
ATTATTAAAG GCACGATTAA AACCATAATT GGTTTCATGT TGTTGCAGGC AGGGTCCGGA
ATCCTCACCA GCACCTTCAA ACCGGTGGTG GCGAAAATGT CCGAAGTCTA CGGCATTAAC
GGCGCAATTT CCGATACTTA CGCGTCGATG ATGGCAACCA TCGACCGCAT GGGCGATGCC
TATAGCTGGG TAGGGTACGC GGTGCTGCTG GCACTGGCGC TGAACATTTG TTACGTGCTG
ATGCGTCGTA TTACCGGTAT TCGCACCATC ATGCTGACCG GCCACATTAT GTTTCAGCAG
GCGGGGCTGA TTGCCGTCAC GCTGTTCATC TTTGGCTACT CCATGTGGAC GACCATTATC
TGCACGGCGA TTCTGGTTTC GCTCTACTGG GGTATTACCT CCAACATGAT GTACAAGCCG
ACTCAGGAAG TGACGGACGG CTGCGGTTTC TCCATCGGTC ACCAGCAACA GTTTGCATCA
TGGATTGCCT ATAAAGTTGC GCCGTTCCTC GGCAAAAAAG AGGAAAGCGT TGAAGACCTC
AAACTGCCAG GCTGGCTGAA TATTTTCCAC GACAACATCG TCTCCACGGC GATTGTGATG
ACCATCTTCT TTGGTGCCAT TCTGCTCTCC TTCGGTATCG ACACTGTGCA GGCGATGGCA
GGCAAAGTGA ACTGGACGGT GTACATCCTG CAAACTGGTT TCTCCTTCGC GGTGGCGATC
TTCATCATCA CTCAGGGCGT GCGCATGTTT GTGGCGGAAC TCTCTGAAGC ATTTAACGGT
ATCTCCCAGC GCCTGATCCC TGGTGCGGTT CTGGCGATTG ACTGTGCGGC TATCTATAGC
TTCGCGCCGA ACGCCGTGGT TTGGGGCTTT ATGTGGGGCA CCATCGGTCA GCTGATTGCG
GTTGGCATCC TGGTCGCCTG CGGTTCCTCG ATCCTGATTA TTCCCGGCTT TATCCCGATG
TTCTTCTCCA ACGCCACCAT CGGCGTGTTC GCTAACCACT TCGGCGGCTG GCGTGCGGCG
CTGAAGATAT GTCTGGTGAT GGGGATGATT GAAATCTTTG GCTGCGTCTG GGCGGTGAAA
CTCACCGGTA TGAGCGCCTG GATGGGCATG GCGGACTGGT CGATTCTGGC ACCGCCGATG
ATGCAGGGCT TCTTCTCCAT CGGTATCGCC TTTATGGCCG TCATCATTGT AATTGCACTG
GCTTATATGT TCTTCGCTGG CCGCGCACTG CGCGCAGAAG AAGATGCAGA AAAACAACTG
GCAGAACAGT CTGCTTAA
 
Protein sequence
MEILYNIFTV FFNQVMTNAP LLLGIVTCLG YILLRKSVSV IIKGTIKTII GFMLLQAGSG 
ILTSTFKPVV AKMSEVYGIN GAISDTYASM MATIDRMGDA YSWVGYAVLL ALALNICYVL
MRRITGIRTI MLTGHIMFQQ AGLIAVTLFI FGYSMWTTII CTAILVSLYW GITSNMMYKP
TQEVTDGCGF SIGHQQQFAS WIAYKVAPFL GKKEESVEDL KLPGWLNIFH DNIVSTAIVM
TIFFGAILLS FGIDTVQAMA GKVNWTVYIL QTGFSFAVAI FIITQGVRMF VAELSEAFNG
ISQRLIPGAV LAIDCAAIYS FAPNAVVWGF MWGTIGQLIA VGILVACGSS ILIIPGFIPM
FFSNATIGVF ANHFGGWRAA LKICLVMGMI EIFGCVWAVK LTGMSAWMGM ADWSILAPPM
MQGFFSIGIA FMAVIIVIAL AYMFFAGRAL RAEEDAEKQL AEQSA