Gene TM1040_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2471 
Symbol 
ID4076836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2611032 
End bp2612243 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID638007795 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_614465 
Protein GI99082311 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0311087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATA TCGCTGTGAT TGGCGCAGGT CAGGCAGGGT CCTCCCTGGT TGCCAAGCTG 
CGCAAATGCG GATTTGACGG TGAGATCACC CTGATCGGTG CCGAGAAGGT GCTCCCGTAT
CAACGCCCTC CGCTGTCCAA GGCCTATTTG CTGGGTGAAA TGGAACTTGA GCGCCTGTTT
CTGCGGCCCG AGAGTTTTTA TGCCGAGAAC AACATCACCC TGCGTCTTGG TACCAAGGTG
GATGCGATCG ACGCGGCTGC AAAGACTTTG CAGATCGGCG ACGAAACGCT GTCCTATGAT
CAGCTTGTGC TCACGACTGG GTCGCACCCA CGTCACCTGC CTGCTGCCAT CGGTGGCGAT
TTGGGCGGTG TGCATGTTGT GCGCGACCTT AAGGATGTAG ACGCGATGGC CCCCGCGGTT
ACGGATGGCG CGCGCGCATT GATTGTCGGC GGCGGTTATA TCGGCCTCGA GGCGGCCGCG
GTCTGCGCCA AACGGGGTGT ATCAGTCACG TTGGTTGAAA TGGCGGACCG CATTTTGCAA
CGAGTCGCCG CGCCAGAGAC TTCCGATTAC TTCCGCGCGC TGCACAGCGC GCAAGGTGTA
GACATCCGCG AGGGCGTGGG CCTGTCGCAT CTGGAGGGAG ACGCAGGCAA GGTGACCTGC
GCGGTGCTGG CGGATGGCAC CCGTCTCGAT GTGGATTTTG TGGTCGTTGG TGTCGGGATC
ACCCCAGCCT CTGAACTGGC CGCAGACGCG GGGCTCGAGA TCGAAAACGG CATTCGCACG
GATGAGCTGG GCCGCACCTC GGATCCGGCA ATCTGGGCCG CAGGCGATTG CGCATCCTTT
CCGTATAAGG GCCAGCGAAT CCGCTTGGAA AGCGTGCCAA ACGCAATTGA TCAGGCCGAA
GTGGTGGCGG AAAATCTCCT AGGGGCAGAA AAAGCCTATG TGGCTACGCC CTGGTTCTGG
TCGGATCAAT ACGATGTGAA ACTGCAAATT GCGGGTCTGA ATTCAGGCTA TGACAATGTT
GTGACGCGTC AGGGCGCAGA TGGCTCCATG TCCTTCTGGT ATTACACCGG GGATCAACTG
GTGGCTGTGG ATGCGATGAA TGATCCACGC GCTTACATGG TAGCAAAACG CTTGATCGAA
GCCGGAAAAA CGGCGGACAA GGCGATTGTG GTCGATCCCG AAGCGGATCT GAAGCCGCTC
CTCAAGGCAT GA
 
Protein sequence
MTHIAVIGAG QAGSSLVAKL RKCGFDGEIT LIGAEKVLPY QRPPLSKAYL LGEMELERLF 
LRPESFYAEN NITLRLGTKV DAIDAAAKTL QIGDETLSYD QLVLTTGSHP RHLPAAIGGD
LGGVHVVRDL KDVDAMAPAV TDGARALIVG GGYIGLEAAA VCAKRGVSVT LVEMADRILQ
RVAAPETSDY FRALHSAQGV DIREGVGLSH LEGDAGKVTC AVLADGTRLD VDFVVVGVGI
TPASELAADA GLEIENGIRT DELGRTSDPA IWAAGDCASF PYKGQRIRLE SVPNAIDQAE
VVAENLLGAE KAYVATPWFW SDQYDVKLQI AGLNSGYDNV VTRQGADGSM SFWYYTGDQL
VAVDAMNDPR AYMVAKRLIE AGKTADKAIV VDPEADLKPL LKA