Gene Arth_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3901 
Symbol 
ID4444541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4394521 
End bp4395501 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID639691726 
Productextracellular solute-binding protein 
Protein accessionYP_833376 
Protein GI116672443 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGTT TTTCACTCAA ACCCGGCGTC ACCGCAGCGC TGGCAGCCAC CACTTTGCTG 
GTGCTTGCGG CCTGCTCCGA CCCCGGCGCC GCAGCGGCCC CGGCGTCCGG TCCGGCGACG
TCCGCCGCAG GCGTCAAGCA GTTCAACCTC TCCCCCGAGC AGAACCGGGA GGCAGCCGCC
GTTGACCCGG CAGCCGCGGC TCTGGTGCCG GAAGCCATCA AGAAAGACGG CAAGCTCACG
GTGGCCGTCA GCCCCTTCGC AGCGCCGTTG GCCGTCTACG CCACGGACAA CAAGACTCCG
GTGGGCAACG AGGTGGATAT CGCCGTCGCA CTGGCCCAGA CGCTGGGGCT GGAGGCTGAC
ATCGTTCCCA CCGCCTGGGC GGACTGGCCC CTCGGCGTCG AGTCCGGCAA GTACGAGGCC
GTGCTATCCA ATGTCACGGT GACCGAGGAG CGGAAACTCA AGTTCGATTT CGCCAGCTAC
CGGGACGACA AACTGGGTTT CTACACCAAG AGCGACAGCT CCATCAGCAA GGTGGAATCC
GCTCCGGATG TTGCCGGGAA GCGCGTGATT GTGGGTTCGG GGACCAACCA GGAGTCCATC
CTGCTGCGCT GGGATGAGGA GAACAAGAAG AACGGCCTGC CAGCGGTGGA GTTCCAGTAT
TACGACGACG ACTCCGCCTC CTCGCTCGCC CTCCAGTCCG GACGCGCGGA CCTGACGTTC
GGGCCGAACG CTACGGCCGC CTTCAAGGCG GCGTCCGACG CAAAGACCAA GCTGGTAGGC
CTGGTGGACG GCGGCTGGCC ACTGAAGGCC AGCATCGCGG CCACCACCAA GAAGGGCAAC
GGCTTTGCCG CTGCCGCCCA GGCCGGCCTG AACCACCTCA TCGAAGACGG CAGCTATGCC
AAAATCCTGG ACCGCTGGGG CCTTAGCGCG GAAGCCGTCC CGAAGTCCGA ACTGAACCCG
GCGGGACTGC CGAAGAAGTA G
 
Protein sequence
MARFSLKPGV TAALAATTLL VLAACSDPGA AAAPASGPAT SAAGVKQFNL SPEQNREAAA 
VDPAAAALVP EAIKKDGKLT VAVSPFAAPL AVYATDNKTP VGNEVDIAVA LAQTLGLEAD
IVPTAWADWP LGVESGKYEA VLSNVTVTEE RKLKFDFASY RDDKLGFYTK SDSSISKVES
APDVAGKRVI VGSGTNQESI LLRWDEENKK NGLPAVEFQY YDDDSASSLA LQSGRADLTF
GPNATAAFKA ASDAKTKLVG LVDGGWPLKA SIAATTKKGN GFAAAAQAGL NHLIEDGSYA
KILDRWGLSA EAVPKSELNP AGLPKK