Gene Namu_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2454 
Symbol 
ID8448065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2707291 
End bp2709054 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content70% 
IMG OID645041568 
Productsulfate transporter 
Protein accessionYP_003201812 
Protein GI258652656 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000318181 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0196593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGCT GGCGACGCGT CAACTGCCTG ACACCGCTGC TGGGCGGTGC CGGGCAGTTC 
CGCAGCTATC GGCGGAGCTG GCTGTCCCGG GACGTCCTGG CCGGACTGAC CGTCGCGGCC
TACCTGGTCC CGCAGGTGAT GGCCTACGCC GAGGTCGCCG GCCTGCCGCC GGTGGTCGGG
CTGTGGGCGA TCATCGGCCC GCTGGCGATC TACGCGTTCG TCGGTTCCTC GCGCCAGCTC
TCGGTCGGAC CGGAGTCGAC CACCGCTTTG ATGACAGCGG TGGTGCTCAT GCCGTTGGCG
GCCGGTGACC CGGTGCGGTA CGCCTCGCTG GCCGCCACCC TGGCCCTGCT CACCGGCGCC
ATCTGCCTAC TCGGGCGACT GCTGCGGCTG GGGTTCCTGG CCGATCTGCT GTCCAAACCG
GTGCTGATCG GCTACATGGC CGGCGTTGCC GTGATCATGA TCGTGGGCCA GCTGGCGAAG
GTCACCGGGG TGCCGGTCAG CGGTGACGAT CTCCTCGGCG AGATGGCCTC GTTCGTCCGC
GGCATCGACC AGATCCGTTG GCCGACAGTC ATTCTGTCCC TGGCGTTGGT GCTGACCCTG
TTCGCGTTCC AGCACTTCGC CCCCCGCCTA CCCGGTCCGC TGATCACCGT CGTCCTGGCC
GCCGCGGTCG TCGCGCTGTT CTCGTTGGGC GAGCACGGCA TCGACGTGGT GGGCGCGGTC
CCGGTCGGCC TGCCGACCCC GGCGCTGCCC GGCCTGGGCA TGGACAGCCT GTCCGTGCTG
GCCATCCCGG CGGTCGGCGT GGCCTTCGTC GGTTACACCG ACAATGTCCT GACCGCGCGT
GCTTTCGCCC TCAAGCAGAA CCAGTCGATC GACGCCAACC AGGAGTGGTT GGCCCTCGGC
CTGGCCAATG CCTCGTCGTC GGTGTTCCAC GGCTTCCCGG TGAGCTCGAG CGGCAGCCGG
ACCGCGATCG CCGCCGCGGT CGGCGCGCGG ACCCAGCTGT ACTCGCTGGT CGCCATGGTC
GTGGTGCTGG TGACCCTATT GGCCGCCGGT CCCCTGCTGG CCGTGTTCCC TCGGCCGGCG
CTGGGTGCGC TGGTCGTCTA CGCCGCGGTG AAACTGATCG ACGGTCCGGA GTTGCGACGG
ATTGCCGCGT TCCGCAAGAG CGAGTTGGTC ACCACGGTCG CCGTGATCGG CCTGGGCGTC
ATCGTCGGCG TCGTGGTGGC CATCGCCCTG TCGGTCGTCG ACCTGCTCCG CCGGGTGGCT
CGGCCCCATG ACGGGATCCT GGGCTACGTG CCCGGGGTCG CCGGCATGCA CGACGTCGAT
GACTACCCCT CCGCGCAGTG CGTTCCGGGC CTGGTCGTCT ACCGCTACGA CGCCCCGCTG
TGCTTCGCCA ACGCCGAGGA CTTCCGGCAT CGGGCGCTGG CCGCCGTCGA GGTCGGCGGG
AACGGGGCCG GTGGCGAACG GGTCCAGTGG TTCATCATGA ACGCCGAGGC GAACGTGGGG
ATCGACATCA CGGCCGCGGA CACGCTCGGT CAGCTGGCCG CCGAACTGGA CCGGCGCGGC
ATCGTCTTCG CGATGGCTCG GGTCAAGCAG GACCTGCTCG CCGACCTGGA CGCGATCGGC
TTCGTGGGCC AGATCGGCTC GCAGCGCATC TATCCCACCT TGCCGACGGC GGTGGAGGGT
TACCTGACCT GGTACCGGAC CAGCCACGGG CAGCTGCCGC AGGGTGTGCA CCAGTCTCCG
TTGCCGACCG ACCCGCTCGC CTGA
 
Protein sequence
MKGWRRVNCL TPLLGGAGQF RSYRRSWLSR DVLAGLTVAA YLVPQVMAYA EVAGLPPVVG 
LWAIIGPLAI YAFVGSSRQL SVGPESTTAL MTAVVLMPLA AGDPVRYASL AATLALLTGA
ICLLGRLLRL GFLADLLSKP VLIGYMAGVA VIMIVGQLAK VTGVPVSGDD LLGEMASFVR
GIDQIRWPTV ILSLALVLTL FAFQHFAPRL PGPLITVVLA AAVVALFSLG EHGIDVVGAV
PVGLPTPALP GLGMDSLSVL AIPAVGVAFV GYTDNVLTAR AFALKQNQSI DANQEWLALG
LANASSSVFH GFPVSSSGSR TAIAAAVGAR TQLYSLVAMV VVLVTLLAAG PLLAVFPRPA
LGALVVYAAV KLIDGPELRR IAAFRKSELV TTVAVIGLGV IVGVVVAIAL SVVDLLRRVA
RPHDGILGYV PGVAGMHDVD DYPSAQCVPG LVVYRYDAPL CFANAEDFRH RALAAVEVGG
NGAGGERVQW FIMNAEANVG IDITAADTLG QLAAELDRRG IVFAMARVKQ DLLADLDAIG
FVGQIGSQRI YPTLPTAVEG YLTWYRTSHG QLPQGVHQSP LPTDPLA