Gene EcolC_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3787 
Symbol 
ID6066462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4145845 
End bp4147578 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content53% 
IMG OID641603200 
Productsurface antigen (D15) 
Protein accessionYP_001726719 
Protein GI170021765 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.718556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCTATA TCCGACAGTT ATGCTGTGTA AGCTTACTCT GCTTAAGCGG ATCTGCCGTC 
GCCGCGAACG TCCGTCTACA GGTCGAGGGG TTATCGGGAC AGCTGGAAAA GAACGTTCGT
GCGCAGCTTT CTACGATTGA AAGTGATGAA GTGACGCCAG ACCGTCGCTT TCGCGCACGC
GTCGATGATG CCATCCGCGA AGGTCTGAAA GCGCTGGGTT ATTACCAGCC GACCATTGAA
TTTGATCTCC GTCCACCGCC AAAGAAAGGG CGGCAGGTAT TGATCGCCAA AGTCACGCCA
GGCGTGCCGG TGTTAATTGG CGGCACCGAT GTGGTATTGC GCGGCGGCGC GCGGACCGAT
AAAGACTATT TGAAATTGCT CGATACTCGC CCGGCTATTG GCACGGTGCT GAACCAGGGC
GATTATGAAA ATTTCAAAAA GTCCTTAACC AGCATTGCGT TGCGTAAAGG TTATTTCGAT
AGCGAATTTA CCAAAGCGCA GCTGGGCATT GCGCTCGGCC TGCATAAAGC CTTCTGGGAT
ATTGATTATA ACAGTGGCGA ACGTTACCGC TTTGGGCATG TGACCTTTGA AGGATCACAA
ATCCGCGATG AATACCTGCA AAATCTGGTG CCGTTTAAAG AGGGCGATGA GTACGAATCG
AAAGATCTGG CGGAACTGAA CCGCCGACTT TCTGCTACCG GCTGGTTTAA CTCGGTGGTG
GTGGCTCCAC AATTTGATAA AGCGCGCGAA ACGAAAGTAT TACCATTGAC GGGCGTGGTT
TCGCCGCGAA CAGAAAACAC CATCGAAACC GGGGTCGGTT ACTCTACGGA CGTGGGACCG
CGCGTGAAAG CGACGTGGAA AAAACCGTGG ATGAACTCAT ACGGTCACAG TCTGACCACC
AGTACCAGTA TTTCCGCGCC GGAACAGATC CTCGACTTCA GCTATAAAAT GCCGCTGCTG
AAGAATCCAC TGGAACAATA TTATTTGGTG CAGGGCGGTT TTAAGCGCAC TGACCTGAAC
GATACCGAAT CTGACTCCAC TACGCTGGTG GCTTCTCGCT ACTGGGATCT CTCCAGCGGC
TGGCAGCGTG CCATTAACCT GCGCTGGAGT CTCGACCACT TTACCCAGGG TGAAATTACC
AACACCACGA TGCTGTTTTA TCCTGGGGTG ATGATTAGCC GCACGCGTTC TCGTGGTGGC
CTGATGCCAA CCTGGGACGA CTCGCAACGC TACTCTATCG ACTACTCCAA CACTGCCTGG
GGCTCAGATG TCGATTTCTC CGTTTTCCAG GCACAAAACG TCTGGATCCG CACACTGTAC
GATCGCCATC GTTTTGTGAC ACGCGGCACG CTGGGCTGGA TTGAAACCGG TGATTTCGAC
AAAGTACCGC CGGATCTGCG TTTTTTCGCC GGGGGCGACC GCAGTATTCG TGGCTACAAA
TACAAATCTA TCGCTCCGAA ATACGCCAAC GGTGACCTGA AAGGGGCCTC GAAGTTGATA
ACCGGATCGC TGGAGTACCA GTACAACGTG ACCGGAAAAT GGTGGGGCGC GGTGTTTGTC
GATAGTGGCG AAGCGGTAAG CGATATTCGC CGCAGCGACT TTAAAACCGG TACCGGGGTC
GGCGTACGCT GGGAATCGCC GGTCGGGCCA ATCAAACTCG ATTTTGCCGT ACCGGTCGCG
GATAAAGACG AACACGGGTT ACAGTTTTAC ATCGGTCTGG GGCCAGAATT ATGA
 
Protein sequence
MRYIRQLCCV SLLCLSGSAV AANVRLQVEG LSGQLEKNVR AQLSTIESDE VTPDRRFRAR 
VDDAIREGLK ALGYYQPTIE FDLRPPPKKG RQVLIAKVTP GVPVLIGGTD VVLRGGARTD
KDYLKLLDTR PAIGTVLNQG DYENFKKSLT SIALRKGYFD SEFTKAQLGI ALGLHKAFWD
IDYNSGERYR FGHVTFEGSQ IRDEYLQNLV PFKEGDEYES KDLAELNRRL SATGWFNSVV
VAPQFDKARE TKVLPLTGVV SPRTENTIET GVGYSTDVGP RVKATWKKPW MNSYGHSLTT
STSISAPEQI LDFSYKMPLL KNPLEQYYLV QGGFKRTDLN DTESDSTTLV ASRYWDLSSG
WQRAINLRWS LDHFTQGEIT NTTMLFYPGV MISRTRSRGG LMPTWDDSQR YSIDYSNTAW
GSDVDFSVFQ AQNVWIRTLY DRHRFVTRGT LGWIETGDFD KVPPDLRFFA GGDRSIRGYK
YKSIAPKYAN GDLKGASKLI TGSLEYQYNV TGKWWGAVFV DSGEAVSDIR RSDFKTGTGV
GVRWESPVGP IKLDFAVPVA DKDEHGLQFY IGLGPEL