Gene Arth_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0104 
Symbol 
ID4447437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp105694 
End bp107196 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content65% 
IMG OID639687899 
Productsulphate transporter 
Protein accessionYP_829605 
Protein GI116668672 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCG AACAACTCCA ATCCGTCCGG GCTACGCTCC GTTCCCCGCG CCGGCTCAAG 
ACGGAAGCCC TCGCCGGACT GGTGGTGGCG CTGGCGCTCA TCCCGGAGGC GATTGCGTTC
TCCGTCATTG CAGGCGTTGA TCCCCGGATC GGGCTGTTCG CGTCCTTCAC CATGGCCGTC
ACGATTTCCT TCGTGGGCGG CCGGCCCGCC ATGATCTCCG CGGCCACCGG CGCGGTGGCC
CTGGTGATCG CACCGCTGAT GCGGAGCCAC GGCCTGGACT ACCTGATCGC CGCAGTGATC
CTGGCGGGGG CCTTCCAGAT CCTCCTGGCG CTCCTGGGCG TCACCAGGCT CATGCGCTTC
ATTCCGCGGT CAGTGATGGT GGGGTTCGTT AACGCGCTGG CCATCCTGGT GTTCATGGCT
CAGCTGCCCG AGCTGATCAA CGTGCCGTGG CTGGTCTACC CGCTCGTGGC CGTTGGCCTG
GTGATCGTGA TTGGACTCCC GCGGCTCACG TCAGCAGTAC CGTCCCCGCT CGTGGCGATC
GTGGCCCTGA CGCTGTTTGC GGTGCTCGCC AAGATCGACG TTCCCGCCGT CCAGGACAAG
GGCCAGCTTC CGGAAAGCCT GCCCACCCTT TTCATTCCCA ACGTGCCGCT GACGCTGGAA
ACCTTCCAGA TCCTCGCCCC CTTCGCGCTG TCCATGGCGC TTGTGGGCCT GCTCGAATCC
CTGATGACGG CGAAGCTTGT CGACGACATC ACGGACACCC GCTCGAACAA GACCCGCGAG
TCCTGGGGCC AGGGCGTGGC GAACATCGTC ACCGGCTTCC TGGGCGGCAT GGGCGGCTGC
GCCGTGATCG GCCAGACCAT GATCAACGTC AAGGGATCCG GCGCCCGGAG CAGGGTCTCT
ACGTTCCTGG CCGGCGTCTT CCTGCTGGTC CTGGTGGTGG CGCTGGGCGA CGTTGTGGGC
CTGATACCCA TGGCAGCGCT CGTGGCCGTG ATGATCTTCG TCTCCGCCAT CACGTTCGAC
TGGCACTCCA TCGCCCTGAA GACGCTCAGG CGGATGCCCA AATCCGAAAC AGCCGTCATG
TTGATCACGG TGGGCACCGT GGTGGCCACC CACAACCTGG CCATCGGAGT AGGCGTCGGC
GTCCTGGCGG CCATGGCCAT GTTTGCCCGG CGGGTGGCGC ATTTCGCCAC GGTTGAACGG
ACGGAGATCG AGCTCAATGG CGAGACCGTG GCAACGTACA CCGTGGACGG AGAGCTCTTC
TTCGCCTCCT CCAACGACCT CTACACCCAG TTCGAGTACG CCCGCGATGC CGCACCCACG
GTGGACCGCG TCATCATCGA TTTGCATGCC TCGCACCTGT GGGACGCATC CACGATCGCC
GTCCTGGACG CTGTCACCGA GAAGTACCGC AGGCACGGCC GTGAAGTGGA GCTGATCGGC
CTGAACTCCG CGAGCACCCA GATGCGTGAG CGGCTCGCCG GAAAGCTCAA CGCCGGGCAC
TGA
 
Protein sequence
MKPEQLQSVR ATLRSPRRLK TEALAGLVVA LALIPEAIAF SVIAGVDPRI GLFASFTMAV 
TISFVGGRPA MISAATGAVA LVIAPLMRSH GLDYLIAAVI LAGAFQILLA LLGVTRLMRF
IPRSVMVGFV NALAILVFMA QLPELINVPW LVYPLVAVGL VIVIGLPRLT SAVPSPLVAI
VALTLFAVLA KIDVPAVQDK GQLPESLPTL FIPNVPLTLE TFQILAPFAL SMALVGLLES
LMTAKLVDDI TDTRSNKTRE SWGQGVANIV TGFLGGMGGC AVIGQTMINV KGSGARSRVS
TFLAGVFLLV LVVALGDVVG LIPMAALVAV MIFVSAITFD WHSIALKTLR RMPKSETAVM
LITVGTVVAT HNLAIGVGVG VLAAMAMFAR RVAHFATVER TEIELNGETV ATYTVDGELF
FASSNDLYTQ FEYARDAAPT VDRVIIDLHA SHLWDASTIA VLDAVTEKYR RHGREVELIG
LNSASTQMRE RLAGKLNAGH