Gene EcSMS35_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2833 
SymbolnorV 
ID6143778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2907703 
End bp2909142 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content53% 
IMG OID641617702 
Productanaerobic nitric oxide reductase flavorubredoxin 
Protein accessionYP_001744857 
Protein GI170679642 
COG category[C] Energy production and conversion 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1773] Rubredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0125184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTG TGGTGAAAAA TAACATTCAT TGGGTTGGTC AACGTGACTG GGAAGTGCGT 
GATTTTCACG GTACAGAATA TAAAACGCTG CGCGGCAGCA GCTACAACAG CTACCTCATC
CGTGAAGAAA AAAACGTGCT GATCGACACC GTCGACCATA AATTCAGCCG CGAATTTGTG
CAGAACCTGC GTAATGAAAT CGATCTGGCA GATATCGATT ACATCGTAAT TAACCATGCT
GAAGAGGACC ACGCCGGGGC GCTGACCGAA CTGATGGCAC AAATTCCCGA TACGCCGATC
TACTGTACTG CCAACGCTAT CGACTCGATA AATGGTCATC ACCATCATCC GGAGTGGAAT
TTTAATGTGG TGAAAACTGG CGACACGCTG GATATCGGCA ACGGCAAACA GCTCATTTTT
GTCGAAACGC CAATGCTGCA CTGGCCGGAC AGCATGATGA CTTACCTGAC AGGCGACGCG
GTGCTGTTCA GTAACGATGC TTTCGGTCAG CACTACTGCG ACGAGCATCT GTTCAACGAT
GAAGTGGATC AGACGGAGCT TTTCGAGCAG TGCCAGCGTT ACTACGCCAA TATCCTGACG
CCGTTCAGCC GCCTGGTAAC GCCGAAAATT ACCGAGATCC TGGGCTTTAA CTTGCCAGTC
GATATGATAG CGACCTCTCA CGGCGTGGTA TGGCGCGATA ACCCGACGCA AATTGTCGAG
CTGTACCTGA AATGGGCGGC GGATTATCAG GAAGACAGAA TCACCATTTT CTACGACACC
ATGTCGAATA ACACCCGCAT GATGGCTGAC GCTATCGCCC AGGGGATTGC GGAAACCGAC
CCACGCGTGG CGGTGAAAAT TTTCAACGTC GCCCGAAGCG ATAAAAACGA AATCCTGACC
AATGTCTTCC GCTCAAAAGG CATGCTGGTC GGCACCTCTA CGATGAATAA TGTGATGATG
CCGAAAATCG CCGGGCTGGT GGAGGAGATG ACCGGATTAC GCTTCCGTAA CAAACGCGCC
AGCGCTTTCG GCTCTCACGG CTGGAGCGGC GGTGCGGTGG ATCGTCTTTC CACGCGCCTG
CAGGATGCGG GTTTCGAAAT GTCGCTTAGC CTGAAAGCCA AATGGCGACC AGACCAGGAC
GCTCTGGAGT TATGTCGTGA ACACGGTCGC GAAATCGCCC GTCAGTGGGC GCTCGCGCCG
CTGCCGCAGA GCACGGTGAA TACGGTAGTT AAAGAAGAAA CCTCTGCCAC CACGACGGCT
GACCTCGGCC CACGGATGCA ATGCAGCGTC TGCCAGTGGA TTTACGATCC GGCAAAAGGT
GAGCCAATGC AGGACGTTGC GCCAGGAACG CCGTGGAGTG AAGTCCCGGA TAACTTCCTG
TGCCCGGAAT GCTCTCTCGG CAAAGACGTC TTTGACGAAC TGGCATCGGA GGCAAAATGA
 
Protein sequence
MSIVVKNNIH WVGQRDWEVR DFHGTEYKTL RGSSYNSYLI REEKNVLIDT VDHKFSREFV 
QNLRNEIDLA DIDYIVINHA EEDHAGALTE LMAQIPDTPI YCTANAIDSI NGHHHHPEWN
FNVVKTGDTL DIGNGKQLIF VETPMLHWPD SMMTYLTGDA VLFSNDAFGQ HYCDEHLFND
EVDQTELFEQ CQRYYANILT PFSRLVTPKI TEILGFNLPV DMIATSHGVV WRDNPTQIVE
LYLKWAADYQ EDRITIFYDT MSNNTRMMAD AIAQGIAETD PRVAVKIFNV ARSDKNEILT
NVFRSKGMLV GTSTMNNVMM PKIAGLVEEM TGLRFRNKRA SAFGSHGWSG GAVDRLSTRL
QDAGFEMSLS LKAKWRPDQD ALELCREHGR EIARQWALAP LPQSTVNTVV KEETSATTTA
DLGPRMQCSV CQWIYDPAKG EPMQDVAPGT PWSEVPDNFL CPECSLGKDV FDELASEAK