Gene Nmul_A1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1807 
Symbol 
ID3786358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2066832 
End bp2068493 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content60% 
IMG OID637811893 
Producthypothetical protein 
Protein accessionYP_412496 
Protein GI82702930 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAAAC AAGCCCGCAA TAAAGAATCG AACCGGGGAC GCTCCGGCAC TTGGGAACCG 
CTCAAGATAC TGCCATTCAG GGCATTCTGG TTCGCCGCGC TTGGCTCCAA CATCGGAACG
TGGATCAATG GCGTATCCTC CGCCTGGGTA ATGACCGACC TGTCTCCCTC ACCGGTGATG
GTGTCCCTGG TGCAGGCGGC CACTTCGTTG CCCATGGTAC TGTTCGCGCT GGCTGCAGGT
GCGCTGACCG ACATTGTAGA CCGACGGCGT TATCTTCTCT TCACGCAGAT ATGGATGGCC
GCGGCTGCGG CAATGCTCAC CGTACTTGCT GCTATCGATC AGATCGATAT CTGGAACCTC
CTGATTCTGA CCTTTGCTTT GGGCATTGGC GCATCACTGG CAACTCCCGC GCTGAATATC
ACTGCCCCCG AGCTTGTTCC CAGGAGTATG TTGCCGGAAG CGGTCGCATT GAGTTCATTG
TCGATGAACC TGTCGCGTTC CCTCGGCCCA GCCATTGCCG GCGTATTGCT GGCCCAGATC
GGCCCATGGG CCGCTTATGG CCTGAATGCG CTCTCGTTTA TAGGCATGAT TGTCGTCCTC
TGGAGATGGA AGCGCGAACC GGAAGAGCGG TCGCTGCCAC CCGAACGCTT CTTCCAGGCA
TTGCGCGCCG GGGTACGTTA TGCTCATGTA GCATCGCCTT TCCGAGCGGT GCTGATTCGT
ACCACGGCTT TTATTCTCTT CGCAGCTTCG GGATGGGCGC TTCTTCCGCT GATTGCACGG
GTCGAACTGG GCGGGGGACC CGGAACTTAT GGACTCTTGC TCTCCTTCGT CGGTATTGGG
GCGGTGTGCG GTATCCTTGT TCTGCCCCGG CTGCATGAAC TCGCCTCCCG CGATCGCCTG
GTGCTGGCGG CAAGCCTGAT TTATGGGGCA ACGATCATGG CACTGGCAAT CCTGCAGAGC
GAAGAGATGC TTTACGCCAT CATGACGCTT TCCGGCGCGG CCTGGGTTAG CGTGCTCTGG
TCACTGCAGG TTACCGCACA AACCTCTGTT CCTGCCTGGG TTCGGGGACG CGCACTGTCT
CTCTATATCA TGGTCTTCTC CGCAGGCCTG GCATTGGGAA GCCTGTTCTG GGGATGGGTT
GCTGCCAGCA CTACCGTTCC CACTGCCCTC CTGCTATCCT CGGCCGGGAC GATGGTGGCG
GCGCTGGCTG TTCGCAATTT CAGTCTGGGC TCCCGGGAGG CTCCGGATCT TGCTCCTTCA
TACCATTGGC GGCCGCATCC TCCAGCAATG GAAGAACCTG ACTTGCGCCG GGGGCCCGTG
CTAGTCACTG TCGAATATGA GATTGGGCTG GATCAGCGGC GAGCCTTCCT GGAAGCAATC
CGCTCACTGG GAGCATCGCG GCGGCGCGAT GGGGCGTTTG CCTGGGGGGT CTTTGAGGAC
CTCGAGAAGC CGGGGCGTTA TATCGAATTC TTCCAGCAGG CCTCGTGGCT GGATCATCTG
CGCCAGCATG CACGCGTTAC CCGCGAGGAC CAGCGAGTGC AGGAAAACGT CAACCGCTTT
CATACGGGCA GCGAAGCCCC ACGCGTTTCA CACTTCATCG GTGGCACACC GACAGCGTCA
ACCGACAGCC CGGCGGCAAC GGGAGGCATG ACTGAAGCAT AA
 
Protein sequence
MTKQARNKES NRGRSGTWEP LKILPFRAFW FAALGSNIGT WINGVSSAWV MTDLSPSPVM 
VSLVQAATSL PMVLFALAAG ALTDIVDRRR YLLFTQIWMA AAAAMLTVLA AIDQIDIWNL
LILTFALGIG ASLATPALNI TAPELVPRSM LPEAVALSSL SMNLSRSLGP AIAGVLLAQI
GPWAAYGLNA LSFIGMIVVL WRWKREPEER SLPPERFFQA LRAGVRYAHV ASPFRAVLIR
TTAFILFAAS GWALLPLIAR VELGGGPGTY GLLLSFVGIG AVCGILVLPR LHELASRDRL
VLAASLIYGA TIMALAILQS EEMLYAIMTL SGAAWVSVLW SLQVTAQTSV PAWVRGRALS
LYIMVFSAGL ALGSLFWGWV AASTTVPTAL LLSSAGTMVA ALAVRNFSLG SREAPDLAPS
YHWRPHPPAM EEPDLRRGPV LVTVEYEIGL DQRRAFLEAI RSLGASRRRD GAFAWGVFED
LEKPGRYIEF FQQASWLDHL RQHARVTRED QRVQENVNRF HTGSEAPRVS HFIGGTPTAS
TDSPAATGGM TEA