Gene Nmul_A0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0966 
Symbol 
ID3785757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1122503 
End bp1123741 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content58% 
IMG OID637811049 
Productmajor facilitator transporter 
Protein accessionYP_411661 
Protein GI82702095 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.582345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTAAAT CACTATTTGC AGGCATCTGG TTCAGGATTG TCCTGGTATT TGCCATGGCC 
CTGCCGATGC TTGTCTTGTA TGCCATCAGC ACGCTTGGAC CATTCCTCGT GCGCGATCTG
CGCTTCGAGC CCGGACTGCC GGGTTATCTC GTGATGAGTG CATTCGGCAT CGCGGCCATT
CTATCGCTCT GGTCAGGCGC GTTCGTCGAC CGGATCGGGA CACGCCGAGC GCTGGCAGTG
CTCTTCTTCG CCGTAGCGCT TGCTTTTGCG CTGATTGCAA CGGTTGAAGA TTATTACTCC
CTGGTTGCAG CGGCTGCCAT TTGCGGAATC GCCCAGGCGC TGGCAAACCC GGTCACCAAT
TTACTGATCG CGCAGCATGT TTCACCGGAG AAAAAGGCGA CAGTCGTAGG ATTCAAGCAA
TCCGGCGTTC AGCTTGCGGC GCTTTTTGCA GGTCTCATTC TTCCCGCCAT CGCAACGCAA
TACGGGTGGC ATGCCGCATT CGGCATCGTT GTTCCGGTAG CGATACTGTT CGGAATTACG
ACACCCTTCA TCGCTCCGCG GAAACCCTCG GAAACGCGTA AAGCTTTCAC TGTCCCCCTG
CCAAATGTAC TGCTGCTGCG TTTGATGGCC ATCCAGTTTT GCGTGGGTAT AGCACTCTCC
GCCTTCGTTA CGTTCCTGCC GACCTTTGCC GTCCGCCAAG GGATGGCGCT GTCGGTGGCG
GGAAGCTTGA TCGCCGTCTT CGGCGTGATG GGGATACTCT CGCGGATAAC ATTGACACCC
CTCGGTGCCA GAATGAAAGA CGAATCGCTG CTTCTGATTG TGCTCATCGC CATTGCAGCT
TGCGCAATCG CGGTGACGAT GAGGGCCAAT GCGGACAGTC ATTGGCCCCT CTGGGCAGGA
GCTGTGGGAA TGGGCCTCAC CGCCGTCGGC ACCAATGCAA TCGCAATGAG CATGTTGATC
AGGGATGCCA CATTTGGCCC GATAGCAACG GCATCCGGTT TTGTTTCAGT CGCTTTCTTC
AGCGGGTTCG CATCCGGCCC GCCCCTTTAT AGCGAGTTTT CGAATTATTC CGGCAACTCC
CAGTCTTCCT GGGGCTTGTT GATCGGCGTG CTCTTGTGCG GGTGCCTGAC GGCCCTGGGA
CTGGCTTCTG CCCGACGGCG CAAGGCGCAA ACACCGGCGC CAACTGGCGC GGTTCAGCGC
GCGACTACCG CAAAGGTGCA AGAACCTGTG AAGCCATGA
 
Protein sequence
MAKSLFAGIW FRIVLVFAMA LPMLVLYAIS TLGPFLVRDL RFEPGLPGYL VMSAFGIAAI 
LSLWSGAFVD RIGTRRALAV LFFAVALAFA LIATVEDYYS LVAAAAICGI AQALANPVTN
LLIAQHVSPE KKATVVGFKQ SGVQLAALFA GLILPAIATQ YGWHAAFGIV VPVAILFGIT
TPFIAPRKPS ETRKAFTVPL PNVLLLRLMA IQFCVGIALS AFVTFLPTFA VRQGMALSVA
GSLIAVFGVM GILSRITLTP LGARMKDESL LLIVLIAIAA CAIAVTMRAN ADSHWPLWAG
AVGMGLTAVG TNAIAMSMLI RDATFGPIAT ASGFVSVAFF SGFASGPPLY SEFSNYSGNS
QSSWGLLIGV LLCGCLTALG LASARRRKAQ TPAPTGAVQR ATTAKVQEPV KP