Gene Arth_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2378 
Symbol 
ID4444992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2665099 
End bp2666454 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID639690186 
Productextracellular solute-binding protein 
Protein accessionYP_831857 
Protein GI116670924 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.612639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGCC CTGCCAAAAT ACTTGCGGCG CTGATGTCGG CGGCAGCGCT GCTGGCCACA 
TCAGCCTGCT CCGCGGAGAA ACCGGCCGCC GAAGACCGGA CCCTGAAGAT CGTCTACCAG
AAGACTGACT CGTTTTCTGC CCTGGACACC TTGTTCAAGG ACGCCAAGAA GGACTTCGAA
GCTGCCAACC AGGGCACCAA AGTGGAGCTG CAGCCCATCG AGGCCAACGA TGACGACTAC
GGCACCAAGC TGGCGCTGGC CCTCCGGTCG TCCGAGACCG CCCCGGACGT CTTTTACGAG
GACACCTTCA AAGTGAGGTC CGATGTCGAC GCCGGATACC TCCTGAAACT GGACGGGTAT
CTCGAAAAAT GGGACGACTG GAAGTTGTAC AACGAGGCCG CCAAAGCTGC CGGCACAGGA
GACGACGGCG GGATCTACGC TGTGCCCCTG GGAACCGACA CCCGCGCCAT CTGGTACAAC
AAGAAAGTCC TGCAAAAAGC AGGCATATCG GTTCCCTGGC AGCCCCGGAG CTGGGACGAA
ATACTCGAAG CCGCCCGCAA GATGAAAGCG GCGGACCCGT CCCTGGTCCC CTTCAACATG
TATGCCGGCA AGGCCACCGG TGAAGGAACC GTCATGCAGA GTTTCTACGA ACTGCTGTAC
GGCACGGACA GCGAACTCTA TGACCAGCAG GAGAAAAAAT GGGTGATCGG TTCCCGGGGG
TTCACCGATT CCCTGGCTTT CCTGAAGACA CTCTACGACG AAGGACTTGC CGTCACGCCT
GCCGAGGCGC TTGACGCCAA CGTCTGGAAG AAGGTCTTCG GCGAATGGCT GCCCAAGGGC
AAGATGGGCG CCACCGTGGA AGGTTCCTAC ACGCCGTCGT TCTGGCAGAA GGGCGGCAAC
TACGAATGGG CAGGCTATGC CGAGGACATG GGGGTGGCGA AATTCCCCAC CCAGCGTGGC
CAGGAACCCG GCGGCGTCAG CATGTCCGGT GGCTGGACGC TGGCCGTCGG AGCCGACTCC
AAAAACCCGG ACCTGGCGTT CAAGTTCCTT TCCGAGGCCG TGAGCAAGAA GAACTCGCTG
GCGTTCACCG TGTCCGGATC CCAGATCGCG GTCCGGACGG ACGTCGCCGC CGAAGCCGAG
TACCTGGCGG CAAATCCGTT CGTCAAAGAC GTCTCCGAAC TCGTATCCGT CACCCACTAC
CGGCCCGCCA CGGCGGACTA TCCGCGGATC TCCGCCGCGG TCCAGGAGGC AACCGAAGCC
GTGATCACCG GTGCCCTCTC ACCGCAGGAG GCAGCCGCGC AGTACGACAA GACAGTCAGG
GACCAGGTGG GTGACGCCAA GGTCCTGCAG AAATAG
 
Protein sequence
MHRPAKILAA LMSAAALLAT SACSAEKPAA EDRTLKIVYQ KTDSFSALDT LFKDAKKDFE 
AANQGTKVEL QPIEANDDDY GTKLALALRS SETAPDVFYE DTFKVRSDVD AGYLLKLDGY
LEKWDDWKLY NEAAKAAGTG DDGGIYAVPL GTDTRAIWYN KKVLQKAGIS VPWQPRSWDE
ILEAARKMKA ADPSLVPFNM YAGKATGEGT VMQSFYELLY GTDSELYDQQ EKKWVIGSRG
FTDSLAFLKT LYDEGLAVTP AEALDANVWK KVFGEWLPKG KMGATVEGSY TPSFWQKGGN
YEWAGYAEDM GVAKFPTQRG QEPGGVSMSG GWTLAVGADS KNPDLAFKFL SEAVSKKNSL
AFTVSGSQIA VRTDVAAEAE YLAANPFVKD VSELVSVTHY RPATADYPRI SAAVQEATEA
VITGALSPQE AAAQYDKTVR DQVGDAKVLQ K