Gene Nmul_A1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1734 
Symbol 
ID3786211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1982237 
End bp1983508 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content55% 
IMG OID637811820 
Producthypothetical protein 
Protein accessionYP_412423 
Protein GI82702857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.996775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC CCTGGCTGGA AGGTATAGAG GTTGTCGATG TAAACACCCC CGTATCTGCG 
TGGGGCGGCT GGTTTGAAAC ATTCGTTATA ACTGCTGTTG TAATTGCGGC ATCATTCATT
ACCCAGCAGG CGGATCCTTT CCGATTGAGT GGCGGATTCC CCTGGGCAGT GCTGGCACCC
CTGCTCGCGG GATTACGGTA CGGGTTTGTC TTTGGTTTTG TCAGCGCGCT GCTGACACTC
GCAGTCCTTG GCGTCGCCAT CGACCAGCAA TGGCAGGCCG CGAAGAGCTT TCCGCTGCCT
TGGGCGATAG GCGTGGTGGT GGTGGCGATG GTGGCAGGGG AGTTTCGTGA CATGTGGGGG
CGTCGCCTGC ACCGGCTGGA GGGCGCGTAT CAATACCGCG CCGAGCGTCT TGAGGAATTC
ACGCGCAGTT ACCAGTTATT GCGCCTCTCG CATGACCGTC TCGAACAGAC CGTTGCCAAC
AGTGGCTTTT CCCTGCGTGA AGGCATCATG CACCTTCAAT CCACGCTGGA CGCCATCGAT
GGAATGACGG AAAGCTCGCT GCAAAAGCTT ATTGAATTTG TGGCGGAATA TGGTGCATTG
ACTCAGGCCT GCATCATCGG GATTACTGCC GACCGCATTG ATACTTCCAA CGTTCTGGCC
TGTGTGGGAG AGCGCTTTCC CATCGATGTG ACTGATCCAG TGCTGAGAAT GGCGCTCGAC
AGCGGCGAAC TGGCCACGTT GAATCTTCTG CAGGAATCAG AGATGGACCA GGCCCAGCTT
CTGGCTGTGG TACCTTTAAC CGACTCCGTC GGCGAAATAA ATGCCGTGCT GGCAGTGCGT
TCCATGCCTT TTTTTTCGTT TCATGAAAGC AATCTCAAAC TTATCGCGGT GCTGGTGGCC
CACGGCGTGG ACCATCTCCG CTTTGGAACT GCGAGGCCAT CGGTTCGTCG GTTTATTGCT
TCATTTGAAC GGGCATATCA GGATTTTTCG CGCTTCAAGC TCGATACCGT GCTCCTGAGA
TTATCCGGAA ATCCGGAGGA GGTGCGGAGC GTTCACGAAA AGCTGCGGTT TTCGATTCGT
GCCATCGACT TTATCTGCCT TGCGCGTGAA AAGGATCAGT ATGTCGTCTG GGCGATGTTG
CCACTGACGG ATATTACTGG GGCACGGGCA TGGGCGCAGC GAGTAGCCGA TATTCCCGCA
ACAACCGCGC AGGAATGGAT GTCTATCAAT GAAATTGATC CGCAAAGGAT CCGTTCCCTG
GAGCAGGGGT GA
 
Protein sequence
MKKPWLEGIE VVDVNTPVSA WGGWFETFVI TAVVIAASFI TQQADPFRLS GGFPWAVLAP 
LLAGLRYGFV FGFVSALLTL AVLGVAIDQQ WQAAKSFPLP WAIGVVVVAM VAGEFRDMWG
RRLHRLEGAY QYRAERLEEF TRSYQLLRLS HDRLEQTVAN SGFSLREGIM HLQSTLDAID
GMTESSLQKL IEFVAEYGAL TQACIIGITA DRIDTSNVLA CVGERFPIDV TDPVLRMALD
SGELATLNLL QESEMDQAQL LAVVPLTDSV GEINAVLAVR SMPFFSFHES NLKLIAVLVA
HGVDHLRFGT ARPSVRRFIA SFERAYQDFS RFKLDTVLLR LSGNPEEVRS VHEKLRFSIR
AIDFICLARE KDQYVVWAML PLTDITGARA WAQRVADIPA TTAQEWMSIN EIDPQRIRSL
EQG