Gene Nmul_A2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2547 
Symbol 
ID3786273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2916261 
End bp2917445 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content55% 
IMG OID637812638 
Producthypothetical protein 
Protein accessionYP_413228 
Protein GI82703662 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.374165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGAAAG ATGTCGTCGT AATAGGCGCT GGCGCAGCCG GCATGATGTG CGCAATGGAA 
GCGGGAAAGC GTGGACGAAG TGTGTTGCTG GTGGATCATG CCAGCAAACT GGCGGAAAAA
ATCCGTATCT CGGGAGGGGG GCGGTGTAAT TTCACGAATC GCCACACTGT TCCGGAAAAT
TTTCTGTCGC AGAATCCGCA TTTCTGCCGA TCCGCACTTG CACGTTTCAC TCCTCGCCAC
TTCATAGAGC TGGTGGAAAA GCACCGCATC CGTTATCACG AAAAAAAGCT GGGACAGCTA
TTTTGCGATG AGGCCTCGCA GCAAATCATC GACATGCTGC GCAGTGAATG CGAGGCAGCC
GGAGTCATCT TTCAAATGCC CTGCGAGGTA AGCCGCATAG ACCGGGACTC CGGCAATACT
GGATTCGTAT TGGAAACCAG CTGTAGCAAA GTGATGGCAG ACGCGCTGGT AATCGCGACC
GGAGGTCTTT CCATTCCGCA AATCGGCGCC AGTCCTTTTG GCTATCGCAT CGCAGAACAG
TTCGGCATAA ACGTTACGGC GCTACGTCCC GCCCTGGTAC CGCTGACTTT TGCGCCGGAA
CAGTTATCCG CTTTTTCAGG GCTCACGGGA ATTGCGCTCG ATACAATAGT GAGCTGTAAC
GGCGCGCATT TTAGAGAAAA TCTGCTGATC ACGCATCGGG GTCTGAGCGG GCCTGCAATC
CTCCAGATTT CCTCATACTG GCGGCCGGGA GATCCGATCC ATATCAACCT GTTACCTGAA
CTGGATGCAG ACGATTGGTT GCGCGATCGC AGACACAGCG GGGTCCTGCT ATCCAATCTG
TTAGCGCAGC ATCTGCCCCG GCGGTTTGCA GAGGCCTGGC TGGGTGCAAT GATGGGTGGG
CTCCCGGAAA CACCTGTAAA CCAGTATGGC AACAAGAGCT TGAGGCAACT GGCTCCTCAA
TTGCATGCCT GGCAGGTTAT TCCGAGCGGC ACCGCCGGCT ATAAAAAGGC GGAAGTAACC
CTTGGAGGCA TCGATACTGC CGAGCTTTCT TCCAAAACGA TGGAATCGAA AAAAGTACCC
GGCCTTTATT TCGTGGGAGA AGTCGTCGAT GTCACGGGCC AACTGGGGGG CTTCAATTTC
CAGTGGGCCT GGTCATCGGG TTATGCGGCA GGGCAATCAG TGTAA
 
Protein sequence
MKKDVVVIGA GAAGMMCAME AGKRGRSVLL VDHASKLAEK IRISGGGRCN FTNRHTVPEN 
FLSQNPHFCR SALARFTPRH FIELVEKHRI RYHEKKLGQL FCDEASQQII DMLRSECEAA
GVIFQMPCEV SRIDRDSGNT GFVLETSCSK VMADALVIAT GGLSIPQIGA SPFGYRIAEQ
FGINVTALRP ALVPLTFAPE QLSAFSGLTG IALDTIVSCN GAHFRENLLI THRGLSGPAI
LQISSYWRPG DPIHINLLPE LDADDWLRDR RHSGVLLSNL LAQHLPRRFA EAWLGAMMGG
LPETPVNQYG NKSLRQLAPQ LHAWQVIPSG TAGYKKAEVT LGGIDTAELS SKTMESKKVP
GLYFVGEVVD VTGQLGGFNF QWAWSSGYAA GQSV