Gene Arth_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3029 
Symbol 
ID4444396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3395328 
End bp3396977 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content67% 
IMG OID639690853 
Productsulphate transporter 
Protein accessionYP_832508 
Protein GI116671575 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGACGC TGCGGGCATT CCTGCCGTCC CGCCGCGATT ACGACGGGCT GCGCTCCTCG 
TGGAAGACGG ACCTCACTGC GGGCATCACG GTGGGGATCG TTGCACTTCC CCTGGCGCTG
GCGTTCGGCG TCAGCTCCGG AGTGGGCGCC GAAGCCGGAC TGATCACCGC TGTGGTTGCC
GGCCTGGTGG CGGCCATCAT GGGCGGCTCC AACGTCCAGG TCTCCGGCCC CACCGGCGCC
ATGGTGGTGG TCCTCGCCCC TGTGGTGGCC AGCCACGGCG CCGGCAGCAT TCCGATTGTT
TCGCTGCTGG CAGGGCTCAT TGTCTGCGCA CTCGGCATCA GCGGACTGGG CCGCGCCGTT
GCTTTCATCC CCTGGCCTGT GGTGGAAGGA TTTACCCTCG GCATTGCCGC CATCATCTTC
CTCCAGCAGG TGCCCCTGGC CACTGGAACA GCGGGCATCC CGGGTCACAA CACCCTCGTT
GCGGCGATCG AAGCCGCCTC GGGAGCCGCG TTTCCCACCG TCATCCAGAC CCTTGCCGTA
GTGGTTGCAG TCGCGGCCAT CATGTTTGTG GTACCCAAGG TTCACAAGTC GCTGCCTGCC
AGCCTGACCG CCGTGCTGCT GGTCACCGTG GCCGCGGAGC TGCTCCGCCT CGACATTCCC
CGGATCGGCG CGCTGCCACA TTCCCTGCCC GCCCCTGCCG TGCCCATGAT CGACCCCACA
GCTCTTGGCG ACCTCATGAT GCCGGCACTG TCCATCGCCG CGCTGGCGGC CATTGAGTCA
CTGCTCTCCG CCCGGGTGGC GGCGGGAATG GTGGGCCCGG ACGGCAGGCC CGGCGGCCGC
TACAGCCCGG ACCGCGAACT GACCGGACAG GGGCTGGCGT CCATTGCGGC CGGACTGTTC
GGCGGAATGC CCGCAACCGG CGCCATCGCC CGGTCCGCCG TCAACGTCCG GTCCGGAGCG
AAGACACGGC TGGCGGCCGT GGTCCACGCC CTGGTGTTGT TGGCAATCAT CTACCTCGCG
GCCGGATTGG TCGGACGGAT TCCGCTGGCC GTGCTGGGCG GAGTCCTGAT GGTCACGGCC
ACGCGAATGG TCTCGACCCA GACGGTCAGC GCCATTCTCC GTTCCACCCG GTCTGATGCG
GCAGTCTTCA TTCTCACTGC CCTGATCACC GTGGCTTTCG ACCTGATCAT CGCCATCCAG
ATCGGACTGG CCGCTGCCGC CCTGCTGACG CTGCGGAAGT TTGCCTCACT GAGCGGCGTC
CGCCGGGAGG CCCTAACGGG CGCGCCCGCC GAAGGAGATT CCCACATCGC CATCTTCAGG
CTCGACGGCG CCATGTTCTT CGGGGCGGCG GAACGGATCC TGCAGGAAAT CAACGAGGTA
AAGGACATCC AGGTGGCCAT CATCCGTTTG TCCCAGGTGC GCATGCTGGA CGCCACGGGC
GCACACGCCC TTGTGGAAGT CATCTCGGCG CTGGAACTCC GCGGAATCAC CGTTCTGCTG
AAGGGGGTCC AGCCGGAGCA CCTGGAACTT GTGACCAACG TGGGTGTGAT CCGTTCCCTC
CGGCACCACA AGCACCTGTT CAGCACACTG CCGGCGGCCG TGGACCACGC CCGCAGCCAT
GTGCTGCGGA ATGCCGCTGC TAGCGCTTGA
 
Protein sequence
MRTLRAFLPS RRDYDGLRSS WKTDLTAGIT VGIVALPLAL AFGVSSGVGA EAGLITAVVA 
GLVAAIMGGS NVQVSGPTGA MVVVLAPVVA SHGAGSIPIV SLLAGLIVCA LGISGLGRAV
AFIPWPVVEG FTLGIAAIIF LQQVPLATGT AGIPGHNTLV AAIEAASGAA FPTVIQTLAV
VVAVAAIMFV VPKVHKSLPA SLTAVLLVTV AAELLRLDIP RIGALPHSLP APAVPMIDPT
ALGDLMMPAL SIAALAAIES LLSARVAAGM VGPDGRPGGR YSPDRELTGQ GLASIAAGLF
GGMPATGAIA RSAVNVRSGA KTRLAAVVHA LVLLAIIYLA AGLVGRIPLA VLGGVLMVTA
TRMVSTQTVS AILRSTRSDA AVFILTALIT VAFDLIIAIQ IGLAAAALLT LRKFASLSGV
RREALTGAPA EGDSHIAIFR LDGAMFFGAA ERILQEINEV KDIQVAIIRL SQVRMLDATG
AHALVEVISA LELRGITVLL KGVQPEHLEL VTNVGVIRSL RHHKHLFSTL PAAVDHARSH
VLRNAAASA