Gene Namu_3444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3444 
Symbol 
ID8449059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3787285 
End bp3789012 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content71% 
IMG OID645042520 
Productsulfate transporter 
Protein accessionYP_003202760 
Protein GI258653604 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0027476 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000776053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGCGCCG TCACTGCCAT CGAACGCGCC GTCCCCGGCG TCGGGATGTT GCGCACCTAC 
CGGCCGGCCT GGCTGGGCAA GGACGTCACC GCCGGCCTGG TGCTCACCGC GCTGCTGGTG
CCGCAGGGCA TGGCCTACGC CGAACTGGCT GGGCTGCCGC CGATCACCGG CCTATACACG
ACCGTGCTGT GCCTGCTCGG GTACGCGGCG TTCGGCCCGT CCAAGGTGCT GGTGCTCGGG
CCCGATTCCT CGCTGGGGCC GATGATCGCG GCCACCGTCA TCCCGCTGGT CACGGCCAAC
GGCGACCCCG GCAAGGCGGT GGCCTACGCC TCGATGCTGG CCCTGATGGT CGGCGCGATC
ACCATCGCCG CGGGCGCCTT CCGGCTCGGC TTCATCGCCG ACCTGCTGTC CAAGCCGACG
CAGGTCGGGT ACATGAACGG CCTGGCCCTG ACCATCGTCA TCGGGCAGCT CCCCAAGCTG
TTCGGCTTCT CGGTGGATGG GGACGGGCTG ATCGAGGAGG CGACCGAGTT CGTCCGCGGG
GTCGCCGATG GGCGGACCGT GCCGGCCGCG CTGGCCATCG GGGTGGGCTC GCTGGCGGTC
ATCCTGCTGC TCAACCGGTT CCTGCCGCGC ATCCCCGGGG TGCTGGTCGC GGTGGTGCTG
GCGATCGCCG CGGTGGCCGT GTTCGACCTG GCCGCGCGCG GGGTCAAGCT CGTCGGCACG
CTGCCCGAAG GCTTCCCGCC GCTGACCATC CCGACGGTGC CGCTGACCGA TCTGGGGCTG
CTGTTCGCCG GGGCACTGGG CATCGCGCTG GTCTCGCTGA CCGACACCAT CTCCACGGCC
AGCGCGTTCG CCGGCCGGCG CGGCGAGGAC GTCAACGGCA ACCGGGAGAT GATCGGCATC
GGCGCCGCCA ACATCGCGGC CGGCCTGTTC CAGGGGTTCC CGGTGTCCAC CAGCGGCTCG
CGGACCGCGG TGGCCGAGCA GAACGGGGCC CGCTCGCAGG TCACCGGCCT GGTCGGCGCG
GGGGCGGTGA CGCTGATGCT GGTGTTCTTC CCCGGGCTGC TGCGCAACCT GCCGCAGCCC
ACCCTGGCCG CCATCGTCAT CGCCGCGTCG ATCTCACTGG CCGACCTGCC GGCCCTGCGC
CGGCTGTGGC GGCAGCGCAA GTCGGACTTC GCGCTGGCCA TGGCCGCGTT CCTGGGGGTG
GCACTGCTCG GCGTGCTGCC CGGCATCGCG ATCGCCGTGG CCCTGTCGGT GCTCAACGTG
TTCAGCCGGG TCTGGCGTCC CTACCGGACC ATGCTGGGCA AGGTCGAGGA CCTCAAGGGC
TACCACGACA TCCGGCGCTA CCCCGCCGCG GATGCGCTGC CCGGGCTGGT GCTGTACCGG
TTCGACGGGC CGCTCATCTT CGCCAACGCC AACACCTTCC GCGACGACCT GCGCCGGTTC
GCCGAGGCGA CTCCCCCGCC GCGGTGGATC GTGGTGACCG CCGAGCCGAT CACCGACGTG
GACACCACCG CCGCGGACAT GCTGGTCGAG CTGGACCTGT GGCTCAACGC GCGCGGGATC
AACCTGGTGT TCGCCGAGAT GAAGGACCCC GTGAAGACCA AGATCGAGCG CTACGAGCTG
ACCGACACGA TCGACCCGAA CCACTTCTTC CCGACGATCG GGTCGGCCGT GCGCGCGTAC
CGGGACATAA CCGGCCTGGA CTGGCCGGAC CGCGATCTGC CCGACTGA
 
Protein sequence
MGAVTAIERA VPGVGMLRTY RPAWLGKDVT AGLVLTALLV PQGMAYAELA GLPPITGLYT 
TVLCLLGYAA FGPSKVLVLG PDSSLGPMIA ATVIPLVTAN GDPGKAVAYA SMLALMVGAI
TIAAGAFRLG FIADLLSKPT QVGYMNGLAL TIVIGQLPKL FGFSVDGDGL IEEATEFVRG
VADGRTVPAA LAIGVGSLAV ILLLNRFLPR IPGVLVAVVL AIAAVAVFDL AARGVKLVGT
LPEGFPPLTI PTVPLTDLGL LFAGALGIAL VSLTDTISTA SAFAGRRGED VNGNREMIGI
GAANIAAGLF QGFPVSTSGS RTAVAEQNGA RSQVTGLVGA GAVTLMLVFF PGLLRNLPQP
TLAAIVIAAS ISLADLPALR RLWRQRKSDF ALAMAAFLGV ALLGVLPGIA IAVALSVLNV
FSRVWRPYRT MLGKVEDLKG YHDIRRYPAA DALPGLVLYR FDGPLIFANA NTFRDDLRRF
AEATPPPRWI VVTAEPITDV DTTAADMLVE LDLWLNARGI NLVFAEMKDP VKTKIERYEL
TDTIDPNHFF PTIGSAVRAY RDITGLDWPD RDLPD