Gene EcSMS35_2132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2132 
Symbol 
ID6145244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2143267 
End bp2144340 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content49% 
IMG OID641617008 
Product4Fe-4S ferredoxin iron-sulfur binding domain-containing protein 
Protein accessionYP_001744183 
Protein GI170682733 
COG category[C] Energy production and conversion 
COG ID[COG0348] Polyferredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA AAAAAAGAAC CCGCTGGCAG CGGCGGCCAG GCACGACGGG CGGCAAATTA 
CCGTGGAATG ACTGGCGCAA TGCCACGACC TGGCGTAAAG CGACGCAATT ATTACTGTTG
GCAATGAATA TTTATATTGC CATCACGTTC TGGTATTGGG TGCGCTATTA CGAAACGGCA
GGTAGTACGA CATTTGTCGC CAGGCCAGGA GGCATCGAAG GCTGGCTACC GATTGCCGGT
CTGATGAATC TGAAATATAG CCTTGCAACA GGCCAGTTAC CGTCCGTCCA CGCCGCCGCG
ATGCTGTTAT TGGTCGCTTT TATCGTCATC AGTCTATTAC TCAAAAAGGC CTTTTGCTCA
TGGTTATGCC CGGTTGGTAC GCTTTCTGAA TTAATCGGCG ATCTCGGTAA CAAACTGTTT
GGTCGGCAAT GTGTCCTTCC CCGCTGTCTG GATATTCCTC TGCGTGGCGT GAAGTATTTG
CTGTTGAGTT TTTTTCTCTA TATCGCGTTA TTGATGCCCG CTCAGGCGAT TCACTATTTT
ATGTTGTCGC CCTACAGCGT GGTGATGGAC GTTAAAATGC TCGATTTCTT TCGCCATATG
GGGACCGCGA CATTAATCAG CGTGACCGTT TTGCTGATTG CCAGCCTGTT TATTCGCCAT
GCCTGGTGTC GTTATCTTTG CCCATATGGC GCGCTGATGG GCGTGGTTTC GCTATTATCA
CCGTTTAAGA TTCGTCGCAA TGCCGAAAGT TGTATCGACT GTGGCAAATG CGCAAAAAAT
TGCCCATCGC GGATCCCGGT CGATAAATTA ATTCAGGTAC GAACAGTGGA ATGTACCGGC
TGTATGACTT GCGTAGAGTC ATGTCCGGTA GCCTCAACAT TGACCTTTTC ACTGCAAAAA
CCTGCGGCAA ATAAAAAAGC TTTTGCGTTG TCTGGCTGGT TAATGACACT ACTGGTTCTG
GGGATTATGT TTGCGGTGAT TGGTTACGCA ATGTATGCGG GAGTATGGCA AAGCCCGGTA
CCGCAGGAAT TGTACCGACG CATAATTCCA CAAGCGCCAA TGATTGGTCA CTAA
 
Protein sequence
MAEKKRTRWQ RRPGTTGGKL PWNDWRNATT WRKATQLLLL AMNIYIAITF WYWVRYYETA 
GSTTFVARPG GIEGWLPIAG LMNLKYSLAT GQLPSVHAAA MLLLVAFIVI SLLLKKAFCS
WLCPVGTLSE LIGDLGNKLF GRQCVLPRCL DIPLRGVKYL LLSFFLYIAL LMPAQAIHYF
MLSPYSVVMD VKMLDFFRHM GTATLISVTV LLIASLFIRH AWCRYLCPYG ALMGVVSLLS
PFKIRRNAES CIDCGKCAKN CPSRIPVDKL IQVRTVECTG CMTCVESCPV ASTLTFSLQK
PAANKKAFAL SGWLMTLLVL GIMFAVIGYA MYAGVWQSPV PQELYRRIIP QAPMIGH