Gene Nmul_A2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2087 
Symbol 
ID3786091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2380050 
End bp2381135 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content49% 
IMG OID637812176 
Producthypothetical protein 
Protein accessionYP_412773 
Protein GI82703207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTGG CAGCCTTTTT TCTTCTAAAT GCAAGCCCCT CCGCAGGAGC AGGGGGTGAC 
CTTGACGATA GACTGCGCTC GCCACTGACA CTCGCTGCCT ATGTTGAAGG TTACTACAGT
CACGATTTCA ACGAACCGGT AAATAACGCT AAACCTCCCT TTCTCACCAG CTTCAGCAAA
AGCAATCAAC CCGCAGTAAA TCTCGCCTTC ATAAAGGCAT CGTACGCAAC ACCCAATATC
AGGGCAAACT TTGCGCTCGC AGCAGGCACC TACATGAACA CGAACTATGC TGCAGAACCT
GGCATTCTGG GCCATTTATA CGAAGGCAAC ATCGCCTTGA GACTATCCGG CGAAAATAAA
CTCTGGCTGG AAGCTGGCGT TTTCCCTTCG CATATCGGCT TTGAAAGCGC AACAGGGAAA
AACAATTGGA CCCTGACGAG AAGCATGGCG GCGGAGAACA CACCCTATTT CGAGTCAGGC
GTCAGGATCG ACTTCACTTC GGCTGATGAT AAATGGTTTT TAAGCGGATT GGTGCTGAAC
GGCTGGCAAC ATATAAAACC GGTGGACGGA AACACGCTTC CCGCCTTCGG CACACAGATT
ACCTACCGAC CTTCTCCTGA AATAACGTTC AATAGCAGCA CCTTTGCGGG CAGCGACAAG
CCCGACAGTC ACCGGCAAAT GCGTTACTTC CATAATTTCT ACGGAATTTT CAAACTGAAT
GAGGAGCTTG CAGCGACTGT TGGATTCGAT ATCGGCGTCG AACAAAAAAG CAAGCATTCG
GGCAGCCTTA ACACGTGGTT CAACCCTACA GTCATTCTGA GATATGTGCA AACGCCCAGA
ACAGCGGTTG CTGTAAGGGC AGAATACTAC AATGACAAAC AAGGCGTAAT GATTGCATCC
GCAAAGCCTC ATGGTTTCCG GACATGGGGT TTTTCAGCCA ATTTCGATTA CAACATCACT
GACAATCTGC TGTGGAGGCT TGAGGCCAGA ACGCTGCTCA GTAAAGACGA TATTTTTGCT
GGTAAAAATG GTACTTCCAG AGATAGCGCC ACTTTTTTCA CTACGTCGAT CGTCGCCCAT
TTTTAA
 
Protein sequence
MVLAAFFLLN ASPSAGAGGD LDDRLRSPLT LAAYVEGYYS HDFNEPVNNA KPPFLTSFSK 
SNQPAVNLAF IKASYATPNI RANFALAAGT YMNTNYAAEP GILGHLYEGN IALRLSGENK
LWLEAGVFPS HIGFESATGK NNWTLTRSMA AENTPYFESG VRIDFTSADD KWFLSGLVLN
GWQHIKPVDG NTLPAFGTQI TYRPSPEITF NSSTFAGSDK PDSHRQMRYF HNFYGIFKLN
EELAATVGFD IGVEQKSKHS GSLNTWFNPT VILRYVQTPR TAVAVRAEYY NDKQGVMIAS
AKPHGFRTWG FSANFDYNIT DNLLWRLEAR TLLSKDDIFA GKNGTSRDSA TFFTTSIVAH
F