Gene Nmul_A2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2200 
Symbol 
ID3786225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2498321 
End bp2499817 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content55% 
IMG OID637812287 
ProductABC transporter related 
Protein accessionYP_412884 
Protein GI82703318 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCCC CGAGAGAACT CACAGCCGAA ACAAGATGCG GCTCCTCCAT GGACTCCTCT 
CCCCCAACGG CAACCGATGA TATCAATAGC GCAGATATTG CGCTTCGCGT ATCGGCCCTG
TCAAAGACCT ATCGCATTTA TGGCCGTTCG CAGGACCGGC TGCTGCAAGG CTTGTGGGGT
AACCGGAAGC AACTGTATCG GGAATTCCGG GCGCTGGACA GCATTTCCTT CGAGATTCGT
CGCGGGGAAA CTTTCGGCAT CATCGGTCCG AACGGATCCG GGAAAAGCAC GCTGCTGCAA
CTGATCGCGG GAACGCTGAC ACCCACTCAG GGTGAAATAT ATATCAAAGG CAAAGTCTCG
GCTTTACTTG AGCTGGGAGC CGGCTTCAAC GGGGAATTTA CTGGCCGCGA GAATATCCGT
ATGGCGGCAT CGATTGCGGG ATTGAATCCT GGCGAGATCG GACAACGCCA CGAGTGCATC
GCCGCTTATG CCGACATTGG CGATTTCCTC GATCAACCGG TAAAGACGTA CTCAAGCGGC
ATGTATGTAC GCCTTGCATT CGCCGTCGCC ATCTCGGTCG AGCCGGAGGT ACTGATCATA
GATGAAGCAC TGGCGGTGGG GGACATGGAA TTTCAGGCGA AGTGCATGGT GACATTGAAG
CAGATGCAGG AACGCGGCAC CACCATTCTG TTTGTCAGCC ATGATGTGGG AGCGGTGAGC
GCCTTATGTA AGCGGACCCT TTATCTCAAG CATGGCCGGG CACTGGAAAT CGGTCCTACT
CCTGACGTCG TTGCACGCTA TATTCGTGAA GTACAGGAAG CAAACAACCG GAAAATAAAT
GTAACCGTTT CTGAAAACAA CGATTCGAGA ACCACGAGCC CATCCGACGC TTCTGAGACC
GGGATCGCTC CCTCCCACCG TACTGCTTCC CTCACCAGCG CTGAGGCCGC ATCTGGAGCC
GCTTCTGGAT TCTTGACGGC AGCAAGCCCG TTCCGGAGCG TGGAGAAGAA ACATCTTGCA
CGCTTTGCTG AAAATGCAAA TCACTGCCGC TCGGGAACCG GCGATGTGCG TGTAATTTAC
GCGGAAATGA CAGATGATGA AGGACTGCCG GTACGCTCGG CAGAATTCGG ACAATCAGTC
CTCATACGGA TCATCGTTGA AGCCGCCCGC ACCTGTACTT TTTCAGTCAA CTACAAAATT
TGCGATAAGA ATCGGACGCC GGTGATAGGA GCGGACTTCC TCATGCAAAG GCAGGCGCTG
CTGACCCTGG AGCCGTCGCA GCAGGCCGAG GCGCTTTACC GGACATCGCT CCCGCTGACC
GACGGCAGAT ACAGCTTGAG AATTTCACTC ACACACCCCG TCAACGCCCA TCAACAGGCA
CTGTTCTTCG ATATCGTGGA GATCGCCCAT GTATTCGAGG TGCTGCCCAA TCCGACTGCA
AAGTTCTGGA CACAGGTTTA TCTGCCGAAT ACGCTCGATG TAAGGGTTCT GGAATGA
 
Protein sequence
MTAPRELTAE TRCGSSMDSS PPTATDDINS ADIALRVSAL SKTYRIYGRS QDRLLQGLWG 
NRKQLYREFR ALDSISFEIR RGETFGIIGP NGSGKSTLLQ LIAGTLTPTQ GEIYIKGKVS
ALLELGAGFN GEFTGRENIR MAASIAGLNP GEIGQRHECI AAYADIGDFL DQPVKTYSSG
MYVRLAFAVA ISVEPEVLII DEALAVGDME FQAKCMVTLK QMQERGTTIL FVSHDVGAVS
ALCKRTLYLK HGRALEIGPT PDVVARYIRE VQEANNRKIN VTVSENNDSR TTSPSDASET
GIAPSHRTAS LTSAEAASGA ASGFLTAASP FRSVEKKHLA RFAENANHCR SGTGDVRVIY
AEMTDDEGLP VRSAEFGQSV LIRIIVEAAR TCTFSVNYKI CDKNRTPVIG ADFLMQRQAL
LTLEPSQQAE ALYRTSLPLT DGRYSLRISL THPVNAHQQA LFFDIVEIAH VFEVLPNPTA
KFWTQVYLPN TLDVRVLE