Gene Arth_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1745 
Symbol 
ID4445714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1952695 
End bp1953588 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content63% 
IMG OID639689565 
Productextracellular solute-binding protein 
Protein accessionYP_831237 
Protein GI116670304 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02995] ectoine/hydroxyectoine ABC transporter solute-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00227146 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGTC ACATTTCACG GCGGAACCTC CTGCGCGGCG CCGGTGCGGC AGCCTTGGGA 
ATCTCGGTGG CAGGCTGGGT AACCAGCTGT TCCACCGTTC CCGTCGGCGG CCCTGCAACA
GGGGCCACCA CCAACCTGCT GGACACTGCG AAGGAGCAGG GCTTCATCCG GGTTGGCATC
GCCAATGAGC CGCCGTACAC CCAGGTCAGC CCGGACGGGA AAGTCACCGG TTGTGAGCCT
GATGTCCTTC GCGCGGTCTG CAAGCGCCTG GGCATCGACG AGGTCCAGGG CATCATCACG
CCGTACGAGT CCATGATTCC GGGGCTCAAT GCCAACCGCT GGGATGTCAT TGCGGCGGGC
CTCTTTATGA AGCAGTCGCG GTGTTCCCAG GTTCTCTACT CGGAGCCGGT TATCGTTTCC
ACCGAGTCCT TCGCCATGCC GAAGGGCAAC CCGAAGGGCA TCCTGACGGT CGCTGACATC
ATTGCCAACC CCGCGCTGCG CATTGCCGTC CTGCCGGGCG GGTTCGAGGA AGGGGTCCTG
AAGGCGGCCA AAGTTCCCGC CAGCCAGCAG GTCAAGGTCA ATGACGGCCG CAGCGGCCTT
GAGGCGCTCA CGGCAAACCG GGCGGACGCC TTCATGCTCC CCACCCTGTC CCTTAAGTCA
CTTGCAGAGA ATGACGGCAG CTTCGATATC ACAGCACCGA TCAAAGACGC TCCCCGCACG
GGCTCGGGTG CTGCTTTCCG CAAGGCTGAC ACGTCCTTCC ACGAGGCTTA CAACAGGGAG
CTTGCCGCGT TCAAGGCCAC TCCTGAGTTT GGCGCGATCC TCACCAAGTG GGGCTTCGAT
CCGACCGTAG TCGAAGGGGC CACTGCGGAG GAACTATGCA AGACCGAGGG CTGA
 
Protein sequence
MSSHISRRNL LRGAGAAALG ISVAGWVTSC STVPVGGPAT GATTNLLDTA KEQGFIRVGI 
ANEPPYTQVS PDGKVTGCEP DVLRAVCKRL GIDEVQGIIT PYESMIPGLN ANRWDVIAAG
LFMKQSRCSQ VLYSEPVIVS TESFAMPKGN PKGILTVADI IANPALRIAV LPGGFEEGVL
KAAKVPASQQ VKVNDGRSGL EALTANRADA FMLPTLSLKS LAENDGSFDI TAPIKDAPRT
GSGAAFRKAD TSFHEAYNRE LAAFKATPEF GAILTKWGFD PTVVEGATAE ELCKTEG