Gene Nmul_A2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2653 
Symbol 
ID3785265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3043677 
End bp3044684 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content58% 
IMG OID637812743 
Productriboflavin biosynthesis protein RibF 
Protein accessionYP_413332 
Protein GI82703766 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0196] FAD synthase 
TIGRFAM ID[TIGR00083] riboflavin kinase/FMN adenylyltransferase
[TIGR00125] cytidyltransferase-related domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATTT CCCACGGCAC TCCCGTTCAG GCTGATGCGC CGGTTGCCCT CACCATCGGC 
AATTTCGATG GGGTGCATCT GGGGCACCAG GCGATGCTTG CGCGCCTGAA AAAGGCAGCT
GACCGGCTTG GTGTCGAATC CTGTGTCATG ATTTTCGAAC CGCATCCGCG TGAATTTTTC
GCACCGGATA AGGCGCCCAC GCGGCTCACC AGCCTGCGGG AGAAGCTGGA GTTGCTGGCG
GCAGCCGGAG TGGAGCGGGT GCAGATATGC CGCTTCGATT TCGATTTTGC CAGAATTCCG
GCGGAGGATT TTATTGTCCG CATCCTTCAG CATGGTCTGG CTGCGCGCTG GATCCTGGTG
GGAGACGACT TTCGCTTCGG TGCACGCCGT GCCGGTGATT ATGAGATGCT CAAGGCATTT
TCGGCAGAGT GCGGTTTCGA GGTGGAGGAC ATGCCGGGCT TTACCGTAAA CGGCCTGCGG
GTTTCAAGCA CGGCGGTTCG CGAAGCATTG GCGGCCGGCG ATCTCGACCT GGTCAAACGC
CTGCTGGGCC GTTTCTACAG CATCAGCGGG CGCGTGGTGG ATGGCGACAA GCTGGGCAAG
AAGATCGGGT TTCCTACCGC GAACATCCAG CTCAAGCATA ATCGCCCGCC GCTGGCGGGA
ATCTTTGCCG TTGAAGTCGA AATGGAAGCT TCGGGTAACG AGACGCCGCC GTCATTGGAA
GTGCCTTCAT TGGATACACT GCGGGGGGTT GCAAGCCTGG GGGTGCGTCC TACCGTGCAT
GAGCATGGCA AGCCGGTGCT GGAGGTTCAC CTGTTCGATT TCGACCAGGA AATTTATGGC
CGCCATTTGC GGGTGCATTT TCTTCACAAG CTGCGTGATG AGGAAAAATA TTCCGGCCTC
GAGGCGCTGA CCAGACAGAT TGGCCGGGAT GTGGACAATG CAAAGAATTA CTTCTCCTCG
TTATCCGCAG CGCCCGCTCT TTCTGAAAAG AAGCTTGTGC AAGGATAA
 
Protein sequence
MRISHGTPVQ ADAPVALTIG NFDGVHLGHQ AMLARLKKAA DRLGVESCVM IFEPHPREFF 
APDKAPTRLT SLREKLELLA AAGVERVQIC RFDFDFARIP AEDFIVRILQ HGLAARWILV
GDDFRFGARR AGDYEMLKAF SAECGFEVED MPGFTVNGLR VSSTAVREAL AAGDLDLVKR
LLGRFYSISG RVVDGDKLGK KIGFPTANIQ LKHNRPPLAG IFAVEVEMEA SGNETPPSLE
VPSLDTLRGV ASLGVRPTVH EHGKPVLEVH LFDFDQEIYG RHLRVHFLHK LRDEEKYSGL
EALTRQIGRD VDNAKNYFSS LSAAPALSEK KLVQG