Gene Nmul_A0867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0867 
Symbol 
ID3784437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp986247 
End bp987602 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content57% 
IMG OID637810949 
Producthypothetical protein 
Protein accessionYP_411562 
Protein GI82701996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.886139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTGA ACCCATCATT GATCCCTCAT TCGTCAGAGC AGCACGTTGA CTGGGAGAAA 
GCGCTGCATG GCGGCAACGA TGCGCTGCTG ACGCAACTTG CAGCCGATGA TGATCCCGCT
GCTGCTCTTC TGGAACTTGA AAGCTACATC AATGGGATCT ACCGCGGCAG CCCGTCTCCC
TTATCCAAAC CGCTGCCGGA TATCCGCTTG GAGGTGCGAC GGGCGCTACC CTATGTGGAT
GAAATTCGGG ATCGCCTGGG ATGGCAGGTG CGGGATTTCG ATCTTGGGTT GCTGCGCTTG
ATTCGTGGCT CGAGCACGTT ATCGCCCGTG CTGGGGGCAG GGGTCTCGAT GGATGCAGGG
GCGCCCTCCT GGCCCGAACT GGTGCGTTTG ATGCTGGAGG AAACGCTCGA CAAGGGTCTG
GAGTTCTACG AGTCCGTTCC CGCCGCTGAC AATCCGGCTC AGCCGCCTAT CGAGTTTCTT
CCCGACGGCA CGGTGCGCAC GGGCGGAACC GGAACCTGGC GTTTCGAGCA ACGCGTCAGC
GAAGTGAAGC GCTATACGGC TGAGCAGGAA CAAACGGCGA GGAACGTCCT TGCAGAGGTC
AAGGCCAAAG GTTCTTCAAC CGATGTCGAG ACGCTCATGC ACGGGGCGCA GGTCTGCTAC
GATCTTTGCG GCCAGCATCT TTTTCGTCTG CTCACGAAAA TCATCTATAC ACGTGCGAAA
GAGCCAAGCG AAGTCCATCG GGTTATTGCC GAACTGGCGC AGGCGCAAGA GGTACCCGTG
CGCGGCCCTG GTTTGTTTCC CGGATGGGAT TCAATTATCA CCTACAATAT CGATGCACTA
ATGTCGGAAG CGCTCGCGGA GCAGAAAATA CCGCACGCTG CCTGGGCGAT GAAGGGTGAC
AAGCTGCGGG GCAATCCGGA TGAACTCGCT CAGAAGAGTT CATGGCATGA ACCTATCTTT
CATCTTCACG GTTTTTCGCC GCGGCGGCTG TTCATGATCA CGAATGTGCG CTTTGTATTT
TCCACTTCTC AGTACCTCAC GACGTACAAA GGGCCGCGAT CAAGGATATT GGAAGCGGTC
TACGACGAAT TTCTGGCGAA TCCTGTGCGC ATTGCGCTTT ATATAGGGTG TTCGTTTGCC
GATGAAGCGA TGAACGGCCT CCTGCGTGAG GCCTTCGCGG AATACCCGGG GCGCTATCAC
TATGCATTGC TGAAATGGCC GCGAGACAGG AAAGGAAAGG AGCCGGACAG GAGCGAGATC
GCCGCCGAGT CCGCAAAATA TCTCGAGTTT GGGGTGCGGC CGATTTGGTT CGATGATTTC
GCCGAATTAC CCGGGTTGAT CCGGCAGCTG CAATGA
 
Protein sequence
MQLNPSLIPH SSEQHVDWEK ALHGGNDALL TQLAADDDPA AALLELESYI NGIYRGSPSP 
LSKPLPDIRL EVRRALPYVD EIRDRLGWQV RDFDLGLLRL IRGSSTLSPV LGAGVSMDAG
APSWPELVRL MLEETLDKGL EFYESVPAAD NPAQPPIEFL PDGTVRTGGT GTWRFEQRVS
EVKRYTAEQE QTARNVLAEV KAKGSSTDVE TLMHGAQVCY DLCGQHLFRL LTKIIYTRAK
EPSEVHRVIA ELAQAQEVPV RGPGLFPGWD SIITYNIDAL MSEALAEQKI PHAAWAMKGD
KLRGNPDELA QKSSWHEPIF HLHGFSPRRL FMITNVRFVF STSQYLTTYK GPRSRILEAV
YDEFLANPVR IALYIGCSFA DEAMNGLLRE AFAEYPGRYH YALLKWPRDR KGKEPDRSEI
AAESAKYLEF GVRPIWFDDF AELPGLIRQL Q