Gene Nmul_A2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2202 
Symbol 
ID3786227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2500755 
End bp2502644 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content58% 
IMG OID637812289 
Producthypothetical protein 
Protein accessionYP_412886 
Protein GI82703320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGTAC TCATAACCAA TAACACGCTC GACACCCGCG GCGGCTCCGA ACTATATGTC 
CGCGACCTGG CCTTGGCGCT GCTGCGGCGC GGCCATAATC CAGTGGCCTA TAGTACCCGA
CTTGGAGCGG TTGCCGAGGA GCTGCGCTCG GCGACCATTC CCGTCATTGA CGATCTCAAT
CTGCTGACGG TTCCACCCGA CATTATTCAT GGCCAGCATC ATCTCGATGC AATGACGGCC
ATGTTATATT TTCCCGATAC GCCCGCGGTC TACTTCTGCC ATGGCTGGCT GCCGTGGGAA
GAAATGGCGC CACGCTTCCC CACGATCCGG CATTATGTTG CTGTGGACGA TCTCTGCCAG
GAGCGACTGC AATGCCTCCA TGGCATCCCC CCCGAGCGCA TTCGTGTAAT ACGCAATTTT
GTCGACCTGC AGCGATTCGG CCTGCACGCG GATTTACCTG CCATACCACG CAAGGCCCTG
GTTTTCAGTA ATTACATCGG TGAGGACGGC TGCCTGGGAA TTCTGCGCCA AGCCTGTGCC
GCGCGCGGCA TCGAACTCGA TGCCATCGGC CTTTCTGTCG GGCACAGTGA AGCCCGGCCT
GAACGGATTC TCGGCTGCTA TGACATTGTC TTTGCGAAAG CACGTTGCGC GCTTGAAGCC
CTTGCCAGCG GAACGGCTGT AATAGCCTGC GACGCCGCCG GTCTCGGGGG CATGGTCATG
CCGGACAACT ATGAAACCTT TCGCGCGCTC AATTTCGGTA TACGGAGTCT GCGTAATCCC
ATTACCCTGG ATACCATCAC GCGGGAACTC GACCGGTACG ATGCGCCCGG GGCACGCGAG
GTTACCCGGC GCGTACGATC AGAAGCAGGT ATCGATCCGG CGATCGAACG GATTCTCCAA
GTTTACCAGG AGGCAATGGA AGCGCACGTC CGCGAATCCA GGGAGAGGAA ACCGAAACGG
GAAAGCCCCT CCGACTCCCG CCTTCAGTCG GCAAGCAGAT ATCTCCGCGA GATTGCCGAT
TTCACCAAAA AGCGCCATCA GGTGGAGCAG GAGAAACATC TTGCTGTGGC TGAAGCCGCA
GCGCAGCGTA CACGCGCTGA GCAGGCCGAG TTCGAGTTGG GGAAAATCCA CAATTCGCGT
CTCTGGCCAC TCGTCATGTT GCTCTATCGG CTCAAGTATA GGTTATGGAA TCGTCCCATT
GCCGTTCTTC GGGCACGTCG AGAAAATCGA TTCCGGGAGA ATCGAAATGA CCAAGACGGG
AGCAAGGCTG GAGAAAACAA CCATTCGGCC GATCGCGTCA GCGTATTCGA GCAAATCTAT
GTCCGCAATG CCTGGCAGAG TCCTGAATCC CGGTCAGGGC CGGGCTCCAC GCTGGAACGA
ACCGAGATAC TGCGGTGCGA ACTTCCTCCT CTGCTGGCCC GCCTCGGGGT TCGCACACTG
GTGGATGCAC CTTGCGGGGA TTGCAACTGG CGGCAGCATA CTGTAATCGA TCTCGATGCA
TACATCGGTG TCGATATCGT TCCTGCGCTG ATAGAGGAAA ACCGGCAGCG CTTTCCCCAT
TCCAACTGGA GATTCGAGGT TGCCGACCTG GTAGAAGATG ACTTGCCGCG CGGTGATGCA
GTGCTCTGCC GCGATGCCCT GATCCATTTG TCGCTGACGG ATATTCTGCG GGCGCTTTCC
AATATCCGCC GCTCCGGGGC AAAGTACCTG CTGGCAACCA GCCATGAAAC GACCAGCGCC
AACACAGACA TCGCCACGGG CGGGTGGCGT TCCGTGAACT TGACGCTGGC GCCTTTCAAC
CTTCCCCCTC CATTGGAGCG TATCGTCGAA AATCCGCAAA CCGGAAAGAT ACTGGGCATA
TGGCTGCTGG CAGAGATACC GCTCTCCTGA
 
Protein sequence
MRVLITNNTL DTRGGSELYV RDLALALLRR GHNPVAYSTR LGAVAEELRS ATIPVIDDLN 
LLTVPPDIIH GQHHLDAMTA MLYFPDTPAV YFCHGWLPWE EMAPRFPTIR HYVAVDDLCQ
ERLQCLHGIP PERIRVIRNF VDLQRFGLHA DLPAIPRKAL VFSNYIGEDG CLGILRQACA
ARGIELDAIG LSVGHSEARP ERILGCYDIV FAKARCALEA LASGTAVIAC DAAGLGGMVM
PDNYETFRAL NFGIRSLRNP ITLDTITREL DRYDAPGARE VTRRVRSEAG IDPAIERILQ
VYQEAMEAHV RESRERKPKR ESPSDSRLQS ASRYLREIAD FTKKRHQVEQ EKHLAVAEAA
AQRTRAEQAE FELGKIHNSR LWPLVMLLYR LKYRLWNRPI AVLRARRENR FRENRNDQDG
SKAGENNHSA DRVSVFEQIY VRNAWQSPES RSGPGSTLER TEILRCELPP LLARLGVRTL
VDAPCGDCNW RQHTVIDLDA YIGVDIVPAL IEENRQRFPH SNWRFEVADL VEDDLPRGDA
VLCRDALIHL SLTDILRALS NIRRSGAKYL LATSHETTSA NTDIATGGWR SVNLTLAPFN
LPPPLERIVE NPQTGKILGI WLLAEIPLS