Gene Arth_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3387 
Symbol 
ID4444116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3808714 
End bp3810057 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content61% 
IMG OID639691210 
Productextracellular solute-binding protein 
Protein accessionYP_832862 
Protein GI116671929 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.278202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT CCCTCGGCAC CGCCGCCGTC GCCGCAGCCA TCGCGCTCTC CCTCTCCGCC 
TGTGGGGGCG GCTCAGGCTC CTCTGCAGAA TCGGCCAAGG GCGAGCTCAG CTACTGGCTC
TGGGACGCCA ACCAGCTTCC CGCCTACCAG CAGTGCGCTG ATGACTTCCA GAAGGCCAAC
CCGGACATCA AGGTCAAGAT CACCCAGCGC GGCTGGGACG ATTACTGGAG CACGCTCACG
AACGGGTTCG TTGGCGGCAC GGCTCCCGAC GTCTTCACCA ACCACCTGGG CCGCTACGGC
GAGCTCGCCG CGAACAAGCA GCTGCTGCCC ATTGACGACG CCGTCAAGAA GGACAACGTG
GACCTGTCCG CCTACAACGA GGGACTCGCG GACCTCTGGG TGGGCCAGGA CGGCAAGCGC
TACGGCCTGC CCAAGGACTG GGACACCATC GGGTTGTTCT ACAACAAGGC CATGCTTTCC
AAGGCCGGCG TCTCCGAAGA AGAGATGAAG AACCTCACCT GGAACCCGCA GGACGGCGGA
ACGTACGAGA AGATCATTGC CCACCTGACC GTTGACAAGA ACGGCAAGCG CGGGGACGAA
GCGGGCTTCG ACAAGAACAA TGTGGATGTC TACGGCCTTG GACTCAACGG CGGCGGCGAC
TCCTCAGGCC AGACTGAGTG GAGCTACCTC ACCAACACCA CCGGCTGGTC ACACACGGAC
AAGAACCCGT GGGGAACTCA CTACAACTAT GACGACCCCA AATTCCAGTC CTCTATCGAC
TGGTTCGCAG GGCTGGTTGA CAAGGGCTAC ATGCCCAAGC TTGAAACCAC TGTTGGCGCA
GCCATGGCCG ACACCTTCGC CGCGGGCAAG TCTGCCATCA ACGCCCACGG CTCATGGATG
ATCGGCCAGT ACACCGGGTA CAAGGGTGTT GAGGTGGGCA TCGCTCCCAC CCCCGTGGGT
CCTGAAGGCA AGCGGGCGTC GATGTTCAAC GGCCTGGCCG ACTCGATCTG GGCCGGCACC
AAGAAGAAGG ACGCCGCCAT CAAGTGGGTG GAGTACCTTG CCTCAGCACC TTGCCAGGAC
GTCGTCGCAT CCAAGGCTGT GGTGTTCCCG GCCCTGAAAG CCTCTTCCGA AAAAGCGGCG
GAAGCATTCA AGGCCAAGGG TGTGGATGTC ACCGCCTTCA CCGAGCACGT CAAGAACGGA
ACCACATTCC TGTACCCCAT CACTGACAAT ACTGCCAAGG TCAAGGGCAT CATGGAGCCT
GCCATGGACG CTGTAGTATC CGGCAAAAAG CCTGCCAGCT CCTTGACCGA AGCCAACAAC
CAGGTGAACG ATCTCTTCAA GTAG
 
Protein sequence
MKKSLGTAAV AAAIALSLSA CGGGSGSSAE SAKGELSYWL WDANQLPAYQ QCADDFQKAN 
PDIKVKITQR GWDDYWSTLT NGFVGGTAPD VFTNHLGRYG ELAANKQLLP IDDAVKKDNV
DLSAYNEGLA DLWVGQDGKR YGLPKDWDTI GLFYNKAMLS KAGVSEEEMK NLTWNPQDGG
TYEKIIAHLT VDKNGKRGDE AGFDKNNVDV YGLGLNGGGD SSGQTEWSYL TNTTGWSHTD
KNPWGTHYNY DDPKFQSSID WFAGLVDKGY MPKLETTVGA AMADTFAAGK SAINAHGSWM
IGQYTGYKGV EVGIAPTPVG PEGKRASMFN GLADSIWAGT KKKDAAIKWV EYLASAPCQD
VVASKAVVFP ALKASSEKAA EAFKAKGVDV TAFTEHVKNG TTFLYPITDN TAKVKGIMEP
AMDAVVSGKK PASSLTEANN QVNDLFK