Gene Arth_3326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3326 
Symbol 
ID4443968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3735402 
End bp3736763 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content64% 
IMG OID639691149 
Productextracellular solute-binding protein 
Protein accessionYP_832801 
Protein GI116671868 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000398294 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGCTAT TTTCCCGCCC TGCCTCAGCC GCCAGTCGTG GCGTAGCAGA GGCCACATCC 
GCCCGGGCCG GGCGGAGGCT TCGCCGAACC GGCGCAGTCG CCGCAGCTGC CGCCGTCGTA
CTTGCCCTCA GTGCCTGCGG CGGGGGAGCC GCCCCGCAAA GTGCCGATGG CAAGGTTGAA
CTCCGCTTCT CCTGGTGGGG AGGAGACAAG CGGGCGCAAC TGACGCAGGC CGCGATCGCG
GCATTCGAGG CTGAGAACCC GAACATCAAG ATCAAGCCGG AGTTCGGCGA CTGGAGCGGT
TACTGGGACA AGCTCGCCAC GCAGGTTGCT GCCAACGACG CCCCGGACAT CATCCAGATG
GACGAAAAAT ACATCACGGA GTACTCCAGC CGCGGCGCCC TGCTGGACCT TTCCAAGTAC
GACATTGACA CGTCAAAGTT TGACGAAGCC GCCCTCAACG CCGGGAAGAG CGAGGACGGC
CTGACGGGGA TTGCCGCCGG CATCAACGCT GCAACCATCC TGGCCAACCC GGCAGTCTTC
AAGGCCGCAG GCGTTGCGCT GCCGGACGAC AAGACCTGGA CCTGGGAGGA CTTCGAGCGC
ATCGCTGCCG AGGTCACTGC GAAGTCGCCA AAGGGCACCT ACGGCGCTGC CGCCTACGGC
ACCGATGAAG CCTCGCTCGG CGTATGGCTG CGGCAGAACG GCAAGTCGCT GTACACCAGC
GACGGCAAGC TGGGCTTCGA GCCGGGCGAC ATCGCCGAAT GGTGGGCGTT CCTGAAGGAA
CTCAGCGAGA AGAAGGCCGT GCCCTCAGCC TCGGAGGTGG TTGAGGCCGA GGCGGCACCG
CTGGACCAGA GCGGCCTGGC GACAGGCAAG AACGGGCTCG CGTTCTGGTG GTCCAACCAG
CTGCCGGCGC TGGAGAAGGC TGCCGGCGGA GAACTTCAGA TCCTGCGGTT CCCGTCCAAG
ACCGGCAGCT CCGCGGACGC CAAGCTTTGG TACAAGGCCT CGCAGTTCTG GTCAGCTTCT
TCACGCACCA AGCATCCGGA AGAAACCGCG AAATTCATCA ACTTCCTGGC CAACAACACC
AAGGCCGGCG AAACCCTCCT GGCCGACCGC GGCGTTTATC CCAACTCCGA TGTCCGGGCG
GCAATCGCAC CCAAGCTGAC CCCCGCCGAC ATCAAGGTGG TCAAGTTCAT TGACCAGATC
AAGGGCGAAC TTGGCGAGGC TCCGGCACCG CCGCCGAAGG GCGCGGGTGC CATCCAGGAA
ATCGTCAAGC GCTACACCTC GGAGGTTCTC TTCAACCGGC TGTCCACGGA GGAAGCCGGC
AAGAAGGCAG TCGATGAAAT GAAATCAGCC ATCAGCAGCT AG
 
Protein sequence
MPLFSRPASA ASRGVAEATS ARAGRRLRRT GAVAAAAAVV LALSACGGGA APQSADGKVE 
LRFSWWGGDK RAQLTQAAIA AFEAENPNIK IKPEFGDWSG YWDKLATQVA ANDAPDIIQM
DEKYITEYSS RGALLDLSKY DIDTSKFDEA ALNAGKSEDG LTGIAAGINA ATILANPAVF
KAAGVALPDD KTWTWEDFER IAAEVTAKSP KGTYGAAAYG TDEASLGVWL RQNGKSLYTS
DGKLGFEPGD IAEWWAFLKE LSEKKAVPSA SEVVEAEAAP LDQSGLATGK NGLAFWWSNQ
LPALEKAAGG ELQILRFPSK TGSSADAKLW YKASQFWSAS SRTKHPEETA KFINFLANNT
KAGETLLADR GVYPNSDVRA AIAPKLTPAD IKVVKFIDQI KGELGEAPAP PPKGAGAIQE
IVKRYTSEVL FNRLSTEEAG KKAVDEMKSA ISS