Gene Nmul_A2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2120 
Symbol 
ID3786658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2415673 
End bp2417145 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content51% 
IMG OID637812208 
ProductOmpA-like transmembrane domain-containing protein 
Protein accessionYP_412805 
Protein GI82703239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAATC CTGGAACAAC AGTACTTTTA TTCAAGATAC TTCTTGTCTG TTACCTGCTG 
TTCTCGGTCA TGCTGGTACG GGCCGGGGTG ACGGAACCAA ACCCGCGCCT GGAGGACGCT
TTCGACTTCA TGAATGTGTT GGCGAAAAAG GGGCTCCACG ACCTGAAGGA AGAGCGCTGG
AATGCTTATG GCCAGTTCAC CTACATTTCG AGCTGGAAAA GTGGATTTCC GGCTTCGTAT
ACCAACCTGA ACGGAAGTAT CAACTCATTG TTGCCGGGAA GCGAAAGAAG CTTTACCGGA
ACGGCGACGC TGTACCTGGG GCTCAAGACA TGGCATGGCG GCGAGATTTA TTATGTGCCG
GAAATGATAT CCATGCGGCC CTTATCGGAC TTGAAAGGGC TGGGTGGCGC CATACAAAAC
TTCGAACTGC AAAAAGGCGG TACGGAAACT CCGATCTTTT ACCAGTCGCG TTTGTTTTTC
AAACAGACTT TTGGATTTGG CGGAGAGCGC ATACAACTCA CTTCCGATCC GATGCAGCTC
GGAACAACGG TGGATAGCAG GCGCCTCGTT GTCAGGTTTG GCAATTTCAG CGTGATTGAT
TTTTTCGATA AGAATTCATA CTCCGGCGAC CTGCGCAGGC AATTCTTAAA TATGGCCTTC
ATGACCTACG CAGCCTTCGA TTTCGCCGCG GATGCGAGAG GATATACCTG GGGCGGGGTA
GCCGAATATT TTCATGATGA CTGGACGTTC CGCTTCGGCC ATGTAGCCAC CGCCATCGAC
CCCAACCAGA TCCCTCTCGA TATGAGGGTA TTCAAATATT ACGGACAGCA GGTTGAAGTT
GAACGCCGCC ATTTGTTGAA CGGCTATCCT GGTGCTGTCA GGATACTGGC TTACCGTAAT
CATGAAAACA TGGGTAAATT CAGCGATGCC ATTGCTGCTT TCCGGTTCGA TCCCAATAAG
AATGCCACCA CCTGTACGGG CTTCAATTAC GGCTCGGGTA ATGCAGGAGC GCCGGATTTA
TGCTGGGCGC GCAACCCGAA CGACAAGATG GGCATAGGGA TCAATATTGA GCAGCAAGTG
CTGGATGGCG TCGGCTTATT TTTTCGCGGA ATGTACAGCG ACGGCAAAAC TGAAGTGTAT
TCCTATACCT CTACCGACAG ATCGATTTCA CTTGGCGCTC TGGTTAACGG CTTCCGTTGG
GGGCGGGACG GAGATTTGCT TGGCATCGGA TTTGCTGCGG GCTGGATTTC GGGCCAGCAT
GCCAAATACT TGAATATGGG GGGCGTCGAC GGATTTATCG GGGATGGCCG GATCAAGGCG
CGCGCGGAGG ATGTAGTGGA CATTTTCTAT AGCGTCAATG TGCTAAGTTC CCTTTGGGTG
ACCGCCGACT ATCAGCATAT TACCAACCCT GGTTTCAATG CCGATCGCGG TCCTGTGAAT
ATCTATGGGC TTAGAGTCCA TGCGGAATTC TAA
 
Protein sequence
MINPGTTVLL FKILLVCYLL FSVMLVRAGV TEPNPRLEDA FDFMNVLAKK GLHDLKEERW 
NAYGQFTYIS SWKSGFPASY TNLNGSINSL LPGSERSFTG TATLYLGLKT WHGGEIYYVP
EMISMRPLSD LKGLGGAIQN FELQKGGTET PIFYQSRLFF KQTFGFGGER IQLTSDPMQL
GTTVDSRRLV VRFGNFSVID FFDKNSYSGD LRRQFLNMAF MTYAAFDFAA DARGYTWGGV
AEYFHDDWTF RFGHVATAID PNQIPLDMRV FKYYGQQVEV ERRHLLNGYP GAVRILAYRN
HENMGKFSDA IAAFRFDPNK NATTCTGFNY GSGNAGAPDL CWARNPNDKM GIGINIEQQV
LDGVGLFFRG MYSDGKTEVY SYTSTDRSIS LGALVNGFRW GRDGDLLGIG FAAGWISGQH
AKYLNMGGVD GFIGDGRIKA RAEDVVDIFY SVNVLSSLWV TADYQHITNP GFNADRGPVN
IYGLRVHAEF