Gene Arth_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0414 
Symbol 
ID4447109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp443519 
End bp444556 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content63% 
IMG OID639688213 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_829915 
Protein GI116668982 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACCG AATTGGGCCC GACGCCGGTA AAAAAGCCGG CGTCGGGCAG TCCCGAGATC 
CACCACGCAC CCAAGGGGGT GGGGGAGGAC AACCGGATCG CCAGCCAGGG CAGGTGGGCA
TCGTGGCTGT TGGCCCCCAC GATCATCGCG CTGGCGGTTG TGATCGTTTA CCCGATCATC
AGCGCACTCG TTATGTCCTT CCAAAAGGAC GCCGGCCTGG ATCCCGTCAC CGGGCTTTTC
ACGGCGGGCG GCCCGGCAGG CGTCCAGAAC TACGTGAACT GGCTTGCCCA GCAGTGCTCC
GCTCCCGGCG GCGGCACCGT GGCCTGCCCT CCCGGTACAC TGGGCGCCCA GTTCTGGTCC
GCGACGGCCA CCACGTTTTT CTTCACCGTG GTGACCGTGA CCCTGGAAAC CGTCCTCGGT
TTCTGGATGG CCCTCATCAT GGCCAGGACC TTCCGGGGAC GCAGCCTGGT CCGCGCAGCA
GTCCTGGTCC CGTGGGCCAT TCCCACCGCT GTGACTGCCA AGCTGTGGCT GTTCATCTTC
GCTTTTGAGG GCATCGCGAA CAAGCTGTTC AATACCACCA TCCTGTGGAC CGGCAGCGAG
TGGCCGGCCA AGTGGGCAGT TATCATCGCC GACGTCTGGA AGACCACGCC GTTCATGGCC
CTCCTCATCC TCGCCGGCCT CCAGATGATC CCCGCAGAGG TCTATGAGGC CGCCAAGGTT
GACGGTGCCA GCACCTGGCA GCGGTTCCGC CTAATCACCC TGCCGCTGGT CAAGCCGGCG
CTTATGGTGG CCGTCCTGTT CCGTACCCTG GACGCACTTC GCATGTTCGA CCTGCCGTAC
ATCCTGACGG GCGGGGCCAA CAACACCACC ACGCTGTCCA TCTTGGTGAT CAACCAGATC
AGGCAAGGCT TCAACGCGGC GGCAGCATTG TCCACCATTA CGTTCATCAT CATCTTCATC
GTCGCGTTCA TCTTTGTGCG CTTCCTGGGT GCGAACGTCG TGGAACAAAG CGGAACCACC
GGTAAGGGGA AGAAATGA
 
Protein sequence
MATELGPTPV KKPASGSPEI HHAPKGVGED NRIASQGRWA SWLLAPTIIA LAVVIVYPII 
SALVMSFQKD AGLDPVTGLF TAGGPAGVQN YVNWLAQQCS APGGGTVACP PGTLGAQFWS
ATATTFFFTV VTVTLETVLG FWMALIMART FRGRSLVRAA VLVPWAIPTA VTAKLWLFIF
AFEGIANKLF NTTILWTGSE WPAKWAVIIA DVWKTTPFMA LLILAGLQMI PAEVYEAAKV
DGASTWQRFR LITLPLVKPA LMVAVLFRTL DALRMFDLPY ILTGGANNTT TLSILVINQI
RQGFNAAAAL STITFIIIFI VAFIFVRFLG ANVVEQSGTT GKGKK