Gene EcolC_3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3137 
Symbol 
ID6066412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3436864 
End bp3438084 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID641602553 
Productmajor facilitator transporter 
Protein accessionYP_001726087 
Protein GI170021133 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.546423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000426482 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAATGA GTGAACAACC CCAGCCTGTG GCGGGCGCGG CTGCGTCAAC GACCAAGGCC 
CGAACATCGT TTGGTATTTT AGGTGCTATC AGCCTCTCAC ATCTGCTGAA CGACATGATC
CAATCGCTGA TTCTGGCGAT TTATCCGCTG CTTCAGTCAG AATTTTCTCT GACATTTATG
CAGATTGGCA TGATAACCCT CACCTTCCAG CTCGCCTCTT CGCTACTGCA ACCAGTGGTC
GGCTACTGGA CCGATAAATA TCCGATGCCG TGGTCGTTGC CAATTGGCAT GTGCTTTACC
TTAAGTGGTC TGGTGCTGCT TGCGCTGGCG GGCAGTTTTG GCGCAGTTCT GCTGGCGGCG
GCGCTGGTCG GTACCGGTTC ATCGGTCTTT CATCCGGAAT CTTCTCGCGT GGCCCGTATG
GCTTCCGGCG GGCGGCATGG CCTGGCACAA TCTATCTTTC AGGTCGGCGG CAACTTTGGC
AGTTCCCTGG GACCCTTGCT GGCGGCGGTG ATTATCGCGC CTTATGGCAA AGGCAACGTT
GCCTGGTTTG TGCTTGCGGC ACTGCTGGCG ATCGTGGTGT TGGCACAAAT CAGCCGTTGG
TACTCGGCAC AGCACCGAAT GAATAAAGGA AAACCCAAAG CGACGATAAT CAATCCACTG
CCGCGCAACA AAGTGGTACT GGCAGTCAGC ATTCTGTTAA TCCTCATTTT CTCGAAATAT
TTCTATATGG CGAGCATCAG CAGCTATTAC ACCTTTTATC TGATGCAAAA ATTCGGATTA
TCTATCCAGA ATGCCCAGCT TCATCTGTTT GCCTTCCTGT TTGCCGTTGC GGCAGGTACG
GTGATCGGCG GGCCTGTAGG GGATAAAATT GGACGGAAAT ATGTGATTTG GGGCTCTATC
CTCGGCGTTG CGCCGTTTAC GCTGATTTTA CCCTACGCCA GCCTGCACTG GACGGGGGTT
TTAACGGTGA TTATTGGATT TATCCTCGCT TCGGCATTCT CTGCCATTCT GGTCTACGCT
CAGGAGCTAC TTCCGGGACG TATCGGTATG GTTTCTGGAC TCTTTTTCGG TTTTGCTTTT
GGCATGGGAG GTCTGGGAGC GGCAGTTCTG GGGCTTATCG CCGATCACAC CAGCATCGAG
TTAGTCTATA AAATCTGTGC TTTCCTGCCA CTATTGGGGA TGTTGACCAT ATTCCTGCCT
GATAACCGGC ATAAAGACTG A
 
Protein sequence
MAMSEQPQPV AGAAASTTKA RTSFGILGAI SLSHLLNDMI QSLILAIYPL LQSEFSLTFM 
QIGMITLTFQ LASSLLQPVV GYWTDKYPMP WSLPIGMCFT LSGLVLLALA GSFGAVLLAA
ALVGTGSSVF HPESSRVARM ASGGRHGLAQ SIFQVGGNFG SSLGPLLAAV IIAPYGKGNV
AWFVLAALLA IVVLAQISRW YSAQHRMNKG KPKATIINPL PRNKVVLAVS ILLILIFSKY
FYMASISSYY TFYLMQKFGL SIQNAQLHLF AFLFAVAAGT VIGGPVGDKI GRKYVIWGSI
LGVAPFTLIL PYASLHWTGV LTVIIGFILA SAFSAILVYA QELLPGRIGM VSGLFFGFAF
GMGGLGAAVL GLIADHTSIE LVYKICAFLP LLGMLTIFLP DNRHKD