Gene EcolC_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1679 
Symbol 
ID6064939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1881927 
End bp1883000 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content43% 
IMG OID641601093 
Productporin 
Protein accessionYP_001724658 
Protein GI170019704 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.512961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.118098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA AAGTTCTGGC AATGCTGGTC CCGGCGTTAT TAGTTGCTGG CGCAGCAAAT 
GCGGCTGAAA TTTATAATAA AGATGGCAAT AAAGTTGATT TCTACGGCAA GATGGTTGGC
GAACGCATCT GGTCAAATAC TGATGATAAT AACAGCGAAA ACGAAGATAC CTCCTATGCT
CGTTTCGGTG TTAAAGGTGA AACACAAATC ACCAGCGAAC TGACTGGTTT TGGTCAGTTT
GAATACAACC TCGACGCCAG CAAGCCCGAA GGCTCTAATC AGGAAAAAAC TCGTTTAACC
TTCGCAGGTT TGAAATATAA CGAGTTAGGA TCTTTCGACT ATGGTCGTAA CTACGGTGTT
GCTTATGATG CCGCAGCTTA TACCGATATG TTAGTTGAAT GGGGTGGTGA TTCCTGGGCT
TCCGCTGACA ACTTCATGAA CGGTCGTACC AACGGTGTTG CAACCTACCG TAACTCTGAT
TTCTTTGGTC TGGTTGATGG TCTGAATTTT GCTGTGCAAT ATCAAGGTAA GAACAGCAAT
CGTGGCGTTA CTAAACAAAA CGGTGATGGC TATGCGTTGT CTGTAGACTA CAACATCGAA
GGTTTTGGTT TTGTAGGTGC ATATAGCAAA TCTGATCGTA CTAATGAACA AGCTAGTGAC
GGCTACGGTG ATAACGCTGA AGTGTGGTCA TTAGCAGCCA AGTATGATGC AAATAATATC
TATGCAGCAA TGATGTACGG TGAAACCCGC AACATGACCG TTTTGGCTAA TGATCATTTT
GCAAATAAAA CCCAAAACTT TGAAGCTGTT GTACAGTATC AGTTCGACTT CGGTTTACGT
CCGTCTTTAG GCTACGTATA TTCCAAAGGC AAAGATCTTT ATGCTCGTGA TGGACATAAA
GGTGTTGATG CTGACCGCGT AAATTATATC GAAGTTGGTA CCTGGTACTA CTTCAATAAG
AACATGAACG TCTACACAGC ATACAAGTTT AACCTGCTGG ATAAAGACGA TGCAGCGATT
ACCGATGCCG CAACTGATGA CCAGTTTGCA GTTGGTATCG TCTACCAGTT CTAA
 
Protein sequence
MKRKVLAMLV PALLVAGAAN AAEIYNKDGN KVDFYGKMVG ERIWSNTDDN NSENEDTSYA 
RFGVKGETQI TSELTGFGQF EYNLDASKPE GSNQEKTRLT FAGLKYNELG SFDYGRNYGV
AYDAAAYTDM LVEWGGDSWA SADNFMNGRT NGVATYRNSD FFGLVDGLNF AVQYQGKNSN
RGVTKQNGDG YALSVDYNIE GFGFVGAYSK SDRTNEQASD GYGDNAEVWS LAAKYDANNI
YAAMMYGETR NMTVLANDHF ANKTQNFEAV VQYQFDFGLR PSLGYVYSKG KDLYARDGHK
GVDADRVNYI EVGTWYYFNK NMNVYTAYKF NLLDKDDAAI TDAATDDQFA VGIVYQF