Gene Arth_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0090 
Symbol 
ID4447458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp92050 
End bp93375 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content60% 
IMG OID639687885 
Productextracellular solute-binding protein 
Protein accessionYP_829591 
Protein GI116668658 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.214251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAC CCCAGTCCTC CCGGCGCTCC TTTCTTGCCC TTGCCGCCCT TGCACCTTTC 
GCCGCCATGG TGACCAGTGC CTGCGGGACA TCGGGGCCGG GCGCCTCCAG CGGCGGGGGT
GCCAGCATGT GGTACCTCTC CGGTGAGCCG AACCAGACCA CCATGCAGAA GGCGGTGGAT
GCGTTCGGTT CAGCGAACCC GGACAACAAG GTCACAGTGA CCTACTTCCA GAACGACGCA
TACAAAACCA AGATCAAAAC GGCGATCGGC GCCGGCCAGG CGCCGACCAT CATCTATGGC
TGGGGCGGCG GGACCCTGAA GACCTATGCC GAGGCTAAGC AGGTTGAAGA CCTGACCAGC
TGGTTGGCGG AGAACCCGGA CCTGAAGGAC AAATTCTTCC CCGCATCCTT CGGCGCCGCC
ACCGTGAACG GTAAGGTCTA TGCGCTGCCC AACCAGTACG TGGCCCCGAT CGTGCTGTTC
TACAACAAGG AACTGTTCGA AAAGGCCGGC GCCCAGCCGC CAAAGACCTG GGACGACATC
ATGTCCCTCG TCAAGACCTT CAACAACATG GGTGTGGCGC CCTTCTCCCT TGGTGGACAG
TCCCGGTGGA CCTCCATGAT GTGGCTGGAG TACCTGCTCG ACCGCATCGG CGGAGCCGAA
GTCTTCACAG CCATTTTCGA AGGCAAGCCC GATGCATGGA AAGATCCAGC CGTGATCGAG
ACGGGCACCA AGATCCAGGA GCTCGTCTCG GCGGATGGCT TCATCAAGGG CTTCTCATCC
ATCGCTGCTG ACTCCAATGC TGACCAGGCC CTTCTGTTCA CGGGCAAAGC AGCCATGATG
CTCCACGGTT CCTGGACCTA CGGCGCTATG AAGAAAGGCG GCCAGAACTT CGTCCAGGAC
GGCAAGCTCG GCTTCGTCCA ATTCCCGGTC GTTGCCGGCG GCAAGGGCGA TCCAAAGAAC
GGCGTGGGAA ACCCCGCCCA GTACATGTCC ATTTCCTCAA AGGCCTCTGA AAAGGAAAAA
GAGACGGCGA AGAAGTTCTT CAAGGACGGC ATCCTGACCG ACACGGTCAT AGACACCTAC
ATCAATTCCG GGTCCGTTCC CATTGTCAAC GGCATCGAGG ACAAGCTCAA CACGTCTCCG
GACAAGGACT TCCTGAACTT CGTCTACGAC CTGGCCAAGA ACGCACCGAA CTTCCAGCAG
TCGTGGGACC AGGCACTCAG CCCGACCGCC GCCGAAGCCC TGCTGAACAA CATCGACCAG
TTGTTCCTCA AGTCGATCAC GCCTCAGCAG TTCGCCGAGA ACATGAATGC CACCCTCGGA
AAATGA
 
Protein sequence
MKQPQSSRRS FLALAALAPF AAMVTSACGT SGPGASSGGG ASMWYLSGEP NQTTMQKAVD 
AFGSANPDNK VTVTYFQNDA YKTKIKTAIG AGQAPTIIYG WGGGTLKTYA EAKQVEDLTS
WLAENPDLKD KFFPASFGAA TVNGKVYALP NQYVAPIVLF YNKELFEKAG AQPPKTWDDI
MSLVKTFNNM GVAPFSLGGQ SRWTSMMWLE YLLDRIGGAE VFTAIFEGKP DAWKDPAVIE
TGTKIQELVS ADGFIKGFSS IAADSNADQA LLFTGKAAMM LHGSWTYGAM KKGGQNFVQD
GKLGFVQFPV VAGGKGDPKN GVGNPAQYMS ISSKASEKEK ETAKKFFKDG ILTDTVIDTY
INSGSVPIVN GIEDKLNTSP DKDFLNFVYD LAKNAPNFQQ SWDQALSPTA AEALLNNIDQ
LFLKSITPQQ FAENMNATLG K