Gene EcolC_2975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2975 
Symbol 
ID6065746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3246565 
End bp3247971 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content51% 
IMG OID641602385 
Productouter membrane porin 
Protein accessionYP_001725927 
Protein GI170020973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.162497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACGT TTAGTGGCAA ACGTAGTACG CTGGCGCTGG CTATCGCCGG TGTTACAGCA 
ATGTCGGGCT TTATGGCAAT GCCGGAGGCT CGCGCCGAAG GATTCATCGA CGATTCAACC
TTAACCGGCG GTATATATTA CTGGCAGCGT GAACGCGACC GTAAAGATGT TACCGACGGC
GACAAATACA AAACCAACCT TTCTCACTCC ACCTGGAATG CCAACCTCGA TTTTCAGTCT
GGTTATGCTG CTGATATGTT CGGCCTTGAT ATTGCCGCGT TTACGGCGAT TGAAATGGCG
GAAAACGGCG ACAGCTCTCA CCCGAACGAA ATCGCGTTTT CAAAAAGTAA TAAAGCCTAT
GACGAAGACT GGTCCGGCGA CAAAAGCGGT ATAAGCCTGT ATAAAGCAGC GGCCAAATTT
AAATACGGTC CGGTTTGGGC GAGGGCAGGT TATATTCAGC CAACCGGTCA GACGCTGTTA
GCGCCTCACT GGAGCTTTAT GCCGGGTACT TATCAGGGTG CGGAAGCCGG AGCGAATTTT
GATTACGGCG ATGCCGGTGC GTTGAGTTTC TCCTACATGT GGACCAACGA ATACAAAGCG
CCGTGGCATC TGGAAATGGA TGAGTTTTAT CAGAACGATA AAACCACCAA AGTTGATTAT
CTGCACTCCC TTGGGGCGAA ATACGACTTC AAAAATAACT TCGTACTGGA AGCGGCTTTT
GGTCAGGCGG AAGGGTATAT CGATCAATAT TTTGCCAAAG CCAGCTACAA ATTTGATATC
GCCGGTAGCC CGTTAACCAC CAGCTACCAG TTCTACGGTA CCCGCGATAA AGTTGACGAT
CGCAGCGTCA ACGACCTTTA TGACGGCACC GCCTGGCTGC AGGCGTTGAC CTTTGGTTAC
CGGGCGGCTG ACGTAGTGGA TTTGCGCCTC GAAGGCACCT GGGTTAAGGC TGACGGTCAG
CAGGGATACT TCCTGCAACG TATGACTCCA ACCTACGCTT CCTCAAACGG TCGCCTGGAT
ATCTGGTGGG ACAACCGTTC TGACTTCAAC GCCAACGGCG AAAAAGCAGT CTTCTTCGGT
GCGATGTATG ACCTGAAAAA CTGGAATCTT CCAGGCTTCG CCATCGGCGC TTCCTACGTT
TACGCATGGG ATGCTAAACC TGCGACCTGG CAGAGCAATC CGGATGCGTA CTACGACAAA
AACCGGACTA TTGAAGAGTC TGCATACAGC CTGGATGCGG TCTACACCAT TCAGGACGGT
CGCGCCAAAG GCACGATGTT CAAACTGCAT TTCACCGAAT ACGACAACCA CTCCGACATC
CCAAGCTGGG GCGGTGGTTA CGGCAACATC TTCCAGGATG AGCGTGACGT GAAATTTATG
GTAATCGCAC CATTCACCAT CTTCTGA
 
Protein sequence
MRTFSGKRST LALAIAGVTA MSGFMAMPEA RAEGFIDDST LTGGIYYWQR ERDRKDVTDG 
DKYKTNLSHS TWNANLDFQS GYAADMFGLD IAAFTAIEMA ENGDSSHPNE IAFSKSNKAY
DEDWSGDKSG ISLYKAAAKF KYGPVWARAG YIQPTGQTLL APHWSFMPGT YQGAEAGANF
DYGDAGALSF SYMWTNEYKA PWHLEMDEFY QNDKTTKVDY LHSLGAKYDF KNNFVLEAAF
GQAEGYIDQY FAKASYKFDI AGSPLTTSYQ FYGTRDKVDD RSVNDLYDGT AWLQALTFGY
RAADVVDLRL EGTWVKADGQ QGYFLQRMTP TYASSNGRLD IWWDNRSDFN ANGEKAVFFG
AMYDLKNWNL PGFAIGASYV YAWDAKPATW QSNPDAYYDK NRTIEESAYS LDAVYTIQDG
RAKGTMFKLH FTEYDNHSDI PSWGGGYGNI FQDERDVKFM VIAPFTIF