Gene Nmul_A2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2158 
Symbol 
ID3784398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2452427 
End bp2453488 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content58% 
IMG OID637812246 
ProductRNA 3'-terminal-phosphate cyclase 
Protein accessionYP_412843 
Protein GI82703277 
COG category[A] RNA processing and modification 
COG ID[COG0430] RNA 3'-terminal phosphate cyclase 
TIGRFAM ID[TIGR03399] RNA 3'-phosphate cyclase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.164943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAA TCGATGGTTC TTATGGAGAA GGTGGCGGCC AGTTGCTCCG CACGTCGGTT 
GCGCTTGCGG CGATAACGGG ACAATCGGTT CGTGTGTACA ACATTCGTGC AAAACGTTCC
AATCCTGGTC TTGCGCCTCA ACATCTGACT GCCGTAAAGG CGGTGGCAGC GCTTTGCAGG
GCCCGGACGG AAGGAATGGA AGTCAAATCG CAGGAAATCA TTTTTCGCCC TGGCCCGTTA
CGCGGGGGCG AATATGATTT TCCGATAGGC ACGGCAGGCA GTGTTACCTT GGTGCTCCAG
GCAGCGCTTC CAGTTGCCTT GGCATGCGGA GAAAAGGTGC GGATGAACAT TTCGGGTGGG
ACCGATGTTC GCGCTGCGCC GCCCCTGGAT TACTTCCGCT ACGTATTGCT GCCGCTGGTT
TATAGCATGG GCGCCAGGGC GAAGATCGAA GTGTTGCTCC GGGGGTATTA TCCTCGCGGC
GGAGGAAAGG TGGTTGTGGA CGTAGAACCT TGCCTGCCTT TGCGTCCGGT GCTCCTGAAC
GCATCGGAAG GGCTGGAGGG TATAACCGGT TTCGTACACA TTTCAAACCT GCCCAAGCAC
ATCATCCACC GCATGGCGAA CGGAGCACTG GCGGAACTTT CGACTTTTCC CACCCCAGCT
GTTGGCCTGG AAGTATTCGG GAAGGATGAC GCGATAGGTG AGGGCGGAGC GGTGCTTTTG
ACCGCGCACA AGGAGCATAG CCGTCTGGGG GCATCTGCCG TCGCAGAAAG AGGCGTGCCA
GCCGAACGCC TCGGTGCTGA GGCGGGGCGG TGCTTGCGCG AGGAAATCCT GTCCGGCGCA
ACGCTGGATA TTCATGCGGC AGACCAAGTA TTGATCTACC TCGCGCTGGC GAGCGGCGTA
TCTTGCTTTC TCACAAGGGA ACTCTCCTCC CACGCCGCGA CAACCATTTG GCTGCTGGAA
CAGTTTCTGC CAGTCCGCTT CCAGGTCACA CAGGAGGCGC ATTTGATTCG CGTCCGCGCA
AAGCCGGAAT TTAATGGTAT GTCAAGCTTT TTGTGGCGAT AA
 
Protein sequence
MQEIDGSYGE GGGQLLRTSV ALAAITGQSV RVYNIRAKRS NPGLAPQHLT AVKAVAALCR 
ARTEGMEVKS QEIIFRPGPL RGGEYDFPIG TAGSVTLVLQ AALPVALACG EKVRMNISGG
TDVRAAPPLD YFRYVLLPLV YSMGARAKIE VLLRGYYPRG GGKVVVDVEP CLPLRPVLLN
ASEGLEGITG FVHISNLPKH IIHRMANGAL AELSTFPTPA VGLEVFGKDD AIGEGGAVLL
TAHKEHSRLG ASAVAERGVP AERLGAEAGR CLREEILSGA TLDIHAADQV LIYLALASGV
SCFLTRELSS HAATTIWLLE QFLPVRFQVT QEAHLIRVRA KPEFNGMSSF LWR