Gene EcolC_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1002 
Symbol 
ID6067674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1090147 
End bp1091586 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content53% 
IMG OID641600410 
Productanaerobic nitric oxide reductase flavorubredoxin 
Protein accessionYP_001723998 
Protein GI170019044 
COG category[C] Energy production and conversion 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1773] Rubredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0118173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000199326 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTATTG TGGTGAAAAA TAACATTCAT TGGGTTGGTC AACGTGACTG GGAAGTGCGT 
GATTTTCACG GTACGGAATA TAAAACGCTG CGCGGCAGCA GTTACAACAG CTACCTCATC
CGCGAAGAAA AAAACGTGCT GATCGACACC GTCGACCATA AATTCAGCCG CGAATTTGTG
CAGAACCTGC GTAATGAAAT CGATCTGGCG GATATCGATT ACATCGTGAT TAACCATGCA
GAAGAGGACC ACGCCGGGGC GCTGACCGAA CTGATGGCAC AAATTCCCGA TACGCCGATC
TACTGTACAG CCAACGCTAT CGACTCGATA AATGGTCATC ACCATCATCC GGAGTGGAAT
TTTAATGTGG TGAAAACTGG CGACACGCTG GATATCGGCA ACGGCAAACA GCTCATTTTT
GTCGAAACAC CAATGCTGCA CTGGCCGGAC AGCATGATGA CTTACCTGAC AGGCGACGCG
GTGCTGTTCA GTAACGATGC TTTCGGTCAA CACTACTGCG ACGAGCATCT GTTCAACGAT
GAAGTGGATC AGACGGAGCT TTTCGAGCAG TGCCAGCGTT ACTACGCCAA TATCCTGACG
CCGTTCAGCC GCCTGGTAAC ACCGAAAATT ACCGAGATCC TGGGCTTTAA CTTACCAGTC
GATATGATAG CCACTTCCCA CGGCGTGGTA TGGCGCGATA ACCCGACGCA AATTGTCGAG
CTGTACCTGA AATGGGCGGC TGATTATCAG GAAGACAGAA TCACCATTTT CTACGACACC
ATGTCGAATA ACACCCGCAT GATGGCTGAC GCTATCGCCC AGGGGATTGC GGAAACCGAC
CCACGCGTGG CGGTGAAAAT TTTCAACGTC GCCCGAAGCG ATAAAAACGA AATCCTGACT
AATGTCTTCC GCTCAAAAGG CGTGCTGGTC GGCACTTCGA CGATGAATAA CGTGATGATG
CCGAAAATCG CCGGGCTGGT GGAGGAGATG ACTGGTTTAC GCTTCCGTAA CAAACGCGCC
AGTGCTTTCG GCTCTCACGG CTGGAGCGGC GGTGCGGTGG ATCGTCTTTC CACGCGCCTG
CAGGATGCGG GTTTCGAAAT GTCGCTTAGC CTGAAAGCGA AATGGCGACC AGACCAGGAC
GCTCTGAAGT TATGCCGTGA ACACGGTCGC GAAATCGCCC GTCAGTGGGC GCTCGCGCCG
CTGCCGCAGA GCACGGTGAA TACGGTAGTT AAAGAAGAAA CCTCTGCCAC CACGACGGCT
GACCTCGGCC CACGGATGCA GTGCAGCGTC TGCCAGTGGA TTTACGATCC GGCAAAAGGC
GAGCCAATGC AGGACGTTGC GCCAGGAACG CCGTGGAGTG AAGTCCCGGA TAACTTCCTC
TGCCCGGAAT GCTCCCTCGG CAAAGACGTC TTTGAAGAAC TGGCATCGGA GGCAAAATGA
 
Protein sequence
MSIVVKNNIH WVGQRDWEVR DFHGTEYKTL RGSSYNSYLI REEKNVLIDT VDHKFSREFV 
QNLRNEIDLA DIDYIVINHA EEDHAGALTE LMAQIPDTPI YCTANAIDSI NGHHHHPEWN
FNVVKTGDTL DIGNGKQLIF VETPMLHWPD SMMTYLTGDA VLFSNDAFGQ HYCDEHLFND
EVDQTELFEQ CQRYYANILT PFSRLVTPKI TEILGFNLPV DMIATSHGVV WRDNPTQIVE
LYLKWAADYQ EDRITIFYDT MSNNTRMMAD AIAQGIAETD PRVAVKIFNV ARSDKNEILT
NVFRSKGVLV GTSTMNNVMM PKIAGLVEEM TGLRFRNKRA SAFGSHGWSG GAVDRLSTRL
QDAGFEMSLS LKAKWRPDQD ALKLCREHGR EIARQWALAP LPQSTVNTVV KEETSATTTA
DLGPRMQCSV CQWIYDPAKG EPMQDVAPGT PWSEVPDNFL CPECSLGKDV FEELASEAK