Gene Arth_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0402 
Symbol 
ID4447129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp428500 
End bp429624 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content67% 
IMG OID639688201 
ProductABC transporter related 
Protein accessionYP_829903 
Protein GI116668970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0245246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCC CGGTATCAGC ACGGCCGGCA CCGGCACCTA AGAGCCAGGA CCGGGGGCAC 
GGCGACACTC CCGCCCTGGC GGTGAAAGGC CTGGTCAAGG ATTTCCACAG CGGCGGGCTG
TTCTCACGGG CCTCGGTGCG CGCCCTCGGC GGCGTGGACC TTGCCATCAG GAAAAGTGAG
ATCGTGGCGC TGGTGGGCGA GTCCGGATCC GGCAAGAGCA CGCTGGCCCG CTGCATCGCA
CGGCTGGAGA AGCCCACCGC CGGCCAGATC CTGCTTAATG GCACGGACGT GCTAAAAAGG
GACCGTTTCC AGGCCTCAAG GGAATACCGT TCGCAGCTGC AGATGGTGTT CCAGGACCCC
TTCGGCTCAC TCAACCCCGT CCACCGGATC GAGCACTTCC TCACCCGCTC GCTCACCCTG
CACGGCAAGG CCGGGACCCC GGCGCAGCTG CGCACCCGCC TGGATGGGCT GATGACCACT
GTGGGCCTCA CTCCGGACAT GCTCAACTCC TACCCGCATG AACTTTCCGG CGGGCAACGG
CAGCGCGTGG CCATCGCCCG GGCGCTCGCG GTGGAACCAG AAGTGATCCT CGCGGACGAA
CCCACGTCGA TGCTGGACGT TTCCGTGCGG ATCGGCATCC TCAACCTGAT GCGCCAGTTG
CGGGACAAGC AGGGGATCTC CATGCTCTAC ATCACCCACG ACCTCGCCTC CGCACGCTAC
CTGGCGGACC GGATCGCCGT GATGTTCGCC GGGGAGCTGG TTGAGGAAGG TGAATCGCTG
GACCTGCTGG CCAACCCGGG CCACCCGTAC ACCCGGCTGC TGGTCTCGGC GGTGCCGGAT
CCCGCCCGGA CCGGATCCTA CGATCCCCGC GAACGGGCGG CACTGCGCGC AGCGGTGATG
GAGTCGGCGT CGTGCGCGTT CGACGGCGAC CCGGAGCAGC GCTGTTCCGC CACCGAACCC
GTCCGGCACC GTGTGGGCGA TCCCGCAAAT GAGCACTGGG TGCGCTGCCA CCTTTACCGG
CCGCCGGCCA CTGCGGCAAG CCACGCCCTG TCCGCCGAGC CCCTCGAAAC ACCGGAGACG
CCGCCAACGG ACGGATCCCG CACAGAAAAC AAGGCTTCCT CATGA
 
Protein sequence
MSTPVSARPA PAPKSQDRGH GDTPALAVKG LVKDFHSGGL FSRASVRALG GVDLAIRKSE 
IVALVGESGS GKSTLARCIA RLEKPTAGQI LLNGTDVLKR DRFQASREYR SQLQMVFQDP
FGSLNPVHRI EHFLTRSLTL HGKAGTPAQL RTRLDGLMTT VGLTPDMLNS YPHELSGGQR
QRVAIARALA VEPEVILADE PTSMLDVSVR IGILNLMRQL RDKQGISMLY ITHDLASARY
LADRIAVMFA GELVEEGESL DLLANPGHPY TRLLVSAVPD PARTGSYDPR ERAALRAAVM
ESASCAFDGD PEQRCSATEP VRHRVGDPAN EHWVRCHLYR PPATAASHAL SAEPLETPET
PPTDGSRTEN KASS