Gene Nmul_A1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1733 
Symbol 
ID3786210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1981098 
End bp1982237 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID637811819 
Producthypothetical protein 
Protein accessionYP_412422 
Protein GI82702856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.580195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGCA TGTATCGCTT GCTTGCATTC TCGGGAGTGA CTGCTGAAGT AGTAGCCGTT 
CTCCTGGTCT GGTTGAGTGC GGATGACGCG CTTGCAGATT GGCAGGAAGG CGCGCTGCTG
TTTATGGCGG CTGTATTTCA CTCTGCTTCA TCGTATTACC TTGCGCGCAT GTTCTGGCAG
GCGCTGCCGC GCCGCTACAA GCTTCCTCCC CGTAGAAGTC TGGGATTGCT GTTTGCCTTT
CTATGGATTC TGCCGGTTTT CGGGGCGCTG GGCGTGCTGT GGAGCATAAC ACGTGCATTG
AAACAGCCCC GGACCCGTTC CGCGAAGAAT GTAAAGATCA TAATCCTGCC TGAGCTGCCT
TTTTCACCCC CTGTCATTTT CCCTGTTCCC CCTTACAGCC AGGGAGCCCT GCGCCAGATC
GTCCATTTTG CCCAGCGCTC GCTCAAGCGG TTGAAAGCGG TGATGGCAAC ACGGCATATG
TCGCCGAGAG AAGCCATGGA GATCTGGTCG AAGGCTACTC GCGACCCGAT CGATGACGTA
AGGCTGCTTG CCTATGCGAT GAAGGATGCC CATGAAAAGA GGCTCACTGA CCGCGTCCTG
GCTTTAACCG AAGCGCTGCC ACACCTTCCT CCACGAGCAC AGAATGCCTG CCGCAAGACG
ATCGCTGCGC TATGCTGGGA ACTGGTATAT CACAAGCTGG TACAGGGTGC TGTCAGACAG
CACTGGCTGA AAAACGCCCG CACACAAATG GAGGTCGTAT TGGGCTCGCC ATCGATTACG
CGGCGCGACG TTCCCTCTGC ATCTGTATCG GCTTCTGTGC GTGCTGCATC CGAAGCCTCT
CTGGCGTCGC CCAAGCATGA GATGGGTGAA GCGTCCTCTC TATCCGGGAG CGTGAACGCC
GACAGTTGGT TGTTGTATGG TCGGATTTTA TTGGAATCAG GTGAAGCCGC CCTGGCGAGA
AAGGCTTTTG TCAACGCACA AACCCATGGC GCGGATCAGC AACAACTGTT GCCATGGTTT
GCCGAAATCG CTTTCCGGCA GCGGAAGTTT ACCGAAGCAA AGGCCTGTTT GTCCGCGCTT
GCACGTGTTG GGGAGAAAGG GCGGGAACTG GCTCTGGTAA GAGCATGGTG GAATAAATGA
 
Protein sequence
MRRMYRLLAF SGVTAEVVAV LLVWLSADDA LADWQEGALL FMAAVFHSAS SYYLARMFWQ 
ALPRRYKLPP RRSLGLLFAF LWILPVFGAL GVLWSITRAL KQPRTRSAKN VKIIILPELP
FSPPVIFPVP PYSQGALRQI VHFAQRSLKR LKAVMATRHM SPREAMEIWS KATRDPIDDV
RLLAYAMKDA HEKRLTDRVL ALTEALPHLP PRAQNACRKT IAALCWELVY HKLVQGAVRQ
HWLKNARTQM EVVLGSPSIT RRDVPSASVS ASVRAASEAS LASPKHEMGE ASSLSGSVNA
DSWLLYGRIL LESGEAALAR KAFVNAQTHG ADQQQLLPWF AEIAFRQRKF TEAKACLSAL
ARVGEKGREL ALVRAWWNK