Gene Nmul_A2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2008 
Symbol 
ID3784499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2307292 
End bp2308578 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content57% 
IMG OID637812097 
Producthypothetical protein 
Protein accessionYP_412695 
Protein GI82703129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATATCC ACCGCTGCAT TCAGCGCGAG GTCTGGAGCA AGGAGGCCCT TGGCGATCGG 
ATCATCATGC GCGTCGCGTT GCCGTTCGTA CCGTTCATAC TTGCTGTTCT GGTAGCGGTT
TTCACAGCGA TCAATGGTCG CGCTATCAAG AAGCGGACGG GGAAAGGTTT AATCCGCCAG
ATCCAGGAGC AGATTGAGCT TGCGATACGC TTTGCGATTC TACCCCCATG GTATTACATC
TTCGAACTGC ATGACGATGA CAAGAAACTG CATGCAGGGG AGTATCTGAA CCGGCTTGAG
ACGAAAGGGG GACTCTATCG CTTCCTGCGC GATAACAACG GTGGTCTCCC TATTCCCGCG
GAACGCAGCA CCGGCTCCAT AAAGGATAAG GGACGCTTCC GGGCTCGCTG TCGTGCGCAT
GGGATCACAA CTGCTCCCGT TTTTTTTAAT GTGGCGCAGG AAAAGATTAC GGCGGTGGAT
TGGGGTTTGC CGGAACTACC GGCACTGCCC GAATTACCCG AGCGCGATCT CTTCATAAAA
CCCGTTCACG GACAGGGCGG GAAAAAGGCC ACGCGCTGGG ATTATCTCGG TTCCGGGCAA
TTCCGCCGCA ATGACGGCGA AGTTGCTACT GGAAGTCAAG TGTTGGAGCG GCTGCGGCAC
GCATCGCGGC ACGCGGCTTT CCTGGTGCAG CCGCGGCTTG TGAGTCACTG TGAGATTGCC
GATCTGGCCA ATGGAACACT TTCCACCGTT CGCGTGATGA CATGCCGTAA CGAAAAGGGG
GAGTTCGAAG TGACCAATGC GGCTTTTCGC ATGGCGCGAA ACAAGCTGGT CGTCGTCGAT
AACTTTCACG CTGGGGGTAT TGCAGCCAAT GTCGACATTT CCACCGGTAC GCTCGGAAGG
GGTACGCGCG GGGCTTGGGG AGCCACGGGC GACGGATGGT ATGAACAACA TTCCGAAACC
GGGGCGCAGA TCCAAGGTCG CAAGCTGCCG TGCTGGTTTG AGTTGGTCGA GCTGGTGCAA
TATGCGCATG GCGCCGCGTT TTCTGACCAG GTTGTCATTG GATGGGATGT TGCTCTGCTC
GACAGTGGTC CATGCATCAT GGAAGCCAAC AAGGCGCCCG ATCTGGACAT TATCCAGCGG
GTGGAAGGCG TGCCCCTGGG CAATCAGCGC CTGGGAAAAC TTCTGGCATT CAATCTGATG
CGTACCGTCG AGGCGCAGCA TGCACCTGCA GCGGGCGCCC GAAAGAGCGC CGATAGTTCG
CTGGGAACGC AAACGGAAAA ACCGTGA
 
Protein sequence
MYIHRCIQRE VWSKEALGDR IIMRVALPFV PFILAVLVAV FTAINGRAIK KRTGKGLIRQ 
IQEQIELAIR FAILPPWYYI FELHDDDKKL HAGEYLNRLE TKGGLYRFLR DNNGGLPIPA
ERSTGSIKDK GRFRARCRAH GITTAPVFFN VAQEKITAVD WGLPELPALP ELPERDLFIK
PVHGQGGKKA TRWDYLGSGQ FRRNDGEVAT GSQVLERLRH ASRHAAFLVQ PRLVSHCEIA
DLANGTLSTV RVMTCRNEKG EFEVTNAAFR MARNKLVVVD NFHAGGIAAN VDISTGTLGR
GTRGAWGATG DGWYEQHSET GAQIQGRKLP CWFELVELVQ YAHGAAFSDQ VVIGWDVALL
DSGPCIMEAN KAPDLDIIQR VEGVPLGNQR LGKLLAFNLM RTVEAQHAPA AGARKSADSS
LGTQTEKP