Gene Nmul_A0719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0719 
Symbol 
ID3786065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp834224 
End bp835924 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content56% 
IMG OID637810801 
Productglycoside hydrolase family protein 
Protein accessionYP_411418 
Protein GI82701852 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCAG TACAATTAGT CTTGCTGTGG CACATGCATC AGCCGGATTA TCGCAATTAC 
GAGACCGGCG AATTCATGCT CCCATGGGTA TACCTGCATG CGATCAAGGA TTACAGCGAT
ATGGCTGCTC ACCTTGAGGC TCATCCCCAC ATGAAGGCGG TGGTCAATTT CGTGCCGGTA
TTGCTCGATC AGCTCGAGGA CTATGCCACC CAGTTTGCTT CCGGAGAGAT CCGGGATCCG
CTGTTGCGGT TGCTGGCTAT CCCCAGCCTG GATGACGTTT CCGAAGAAGA TCGGTTACGC
GCCTTCGACA GTTGTTTTCG CAGCAATCAT CTGACCATGA TGCAGCCATA TCCCGCGTAC
AAACGCCTGT ACGACATCCA TGAAATGTTG AGGGAATACG GAGATGGGGA ATTGACCTAT
CTTTCCGGCC AGTATCTTGC GGATCTGCTT GTCTGGTATC ACCTGGCTTG GACGGGTGAA
AGTGTGCGGC GAAGCAGTGA GGTTGTGATC CAGTTGATGA CCCAGGCGAA GGGTTTCAGC
TATGCAGACC GAATGCAGTT GCTCGATGTG ATCGGAGAGG TTGTGCAGGG CTTGATTCCT
CGTTATCGCA AGTTGGTGGA GTCGGGGCAG ATCGAGCTTT CCACCACACC GCATTACCAT
CCGCTCGCAC CCTTGCTGAT CGATTTTTCA TCCGCTCGCG AAAGTGTGCC GGGTTCGGCT
CTGCCGATCG AACCGGTTTA TCCCGGCGGG CGGAGCCGCG TCGCATCCCA ACTGGTTTCC
GCCATTGAAA GCCATGCCGC GCGTTTTGGC GCAAGACCCG AGGGAGTATG GCCGGCGGAA
GGAGCGGTAT CGGCACCGCT ACTTGAGATA CTGGGTGAAA AAGGTTGCCA GTGGTGCGCC
AGTGGCGAAG GGGTGCTGGC GAACAGTTTG CGTCACTCCT ATCCGGGCGA GCCTCTGCCG
GAGAGGAGCC GCTTTCTTTA CCGGCCATAT CGGGTTGACG GCAAATCAGG CGATGTCATT
TGCTTCTTCC GGGACGAGAA GTTGTCGGAC ATGATCGGTT TCGAATATGC CAAGTGGTTT
GGTCGGGACG CTGCCGAGCA CCTGGTGCGA TCTCTGGAGG AGATCGGGCA CAGCGCATTG
CCGGGAGAGA AACCGGTGGT GAGCGTGATT CTCGACGGTG AGAATGCCTG GGAATACTAT
CCTTACAATG GATATTATTT CCTCAATGAT CTGTACGAAA TTCTGGAAAA CCATCCTTCC
ATCCATTCCA CGACCTATCG CGACTATATC GCGTCCGAGA ACGAGAAGGA AGCGGCCCGC
CTGCCGCTTC TGACCGCCGG CAGCTGGGTG TATGGAACTT TCTCCACCTG GATCGGGGAT
CGGGACAAGA ACCGTGCGTG GGATCTGCTG AGCGCCGCCA AGCACAGTTA TGATCTTGTC
ATGCAAAGTG GGCGCCTGAC CCCTGACGAA AGAAAGAAAG CGGAGCGGCA GCTTGCGTCC
TGCGAAAGCT CCGACTGGTT CTGGTGGTTG GGGGACTATA ATCCGCCTTA CGCGGTATCG
AGCTTCGACC AGTTATTCCG CGACAATCTT GCCAATCTTT ATGTCCTGCT GAAATTGCCC
GTACCCATTT CCATCACTGA GCCAATCAGC CACGGAGGAG GAGTGCATGA AACGAGTGGT
GCGATGCGGC GTGCTTCCTG A
 
Protein sequence
MQPVQLVLLW HMHQPDYRNY ETGEFMLPWV YLHAIKDYSD MAAHLEAHPH MKAVVNFVPV 
LLDQLEDYAT QFASGEIRDP LLRLLAIPSL DDVSEEDRLR AFDSCFRSNH LTMMQPYPAY
KRLYDIHEML REYGDGELTY LSGQYLADLL VWYHLAWTGE SVRRSSEVVI QLMTQAKGFS
YADRMQLLDV IGEVVQGLIP RYRKLVESGQ IELSTTPHYH PLAPLLIDFS SARESVPGSA
LPIEPVYPGG RSRVASQLVS AIESHAARFG ARPEGVWPAE GAVSAPLLEI LGEKGCQWCA
SGEGVLANSL RHSYPGEPLP ERSRFLYRPY RVDGKSGDVI CFFRDEKLSD MIGFEYAKWF
GRDAAEHLVR SLEEIGHSAL PGEKPVVSVI LDGENAWEYY PYNGYYFLND LYEILENHPS
IHSTTYRDYI ASENEKEAAR LPLLTAGSWV YGTFSTWIGD RDKNRAWDLL SAAKHSYDLV
MQSGRLTPDE RKKAERQLAS CESSDWFWWL GDYNPPYAVS SFDQLFRDNL ANLYVLLKLP
VPISITEPIS HGGGVHETSG AMRRAS