Gene Arth_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3329 
Symbol 
ID4444058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3739225 
End bp3740520 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID639691152 
Productextracellular solute-binding protein 
Protein accessionYP_832804 
Protein GI116671871 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0332045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTGG CTCTCAAGAA GTCTGTAATG GGTGTCGCCG GCGCCACGGC TGCGCTGGCG 
TTGGTACTGA CGGGCTGCGG CAACAGCCCG CAGGCCGGGA AGGTGGGAAC GGCGGAGGAC
CCGGTGACCA TCAGGTTCGC ATGGTGGGGC AATGACTCCC GCGCCAAGAC CACCCTGGAA
GTGATCAAGG ACTTCGAGGC TGCCAACCCC ACCATCAAGG TGCAGGGTGA GAACACCGAG
TTCAGCTCCT ACTGGGACAA GATGGCAACC CAGATCGCCG GCGGAACGAC GCCGGACGTG
TTCGCCATGA GCGGCGCCTA CCCCAGCGAA TACGCCAGCC GCGGTGTGCT CCTGGACTTG
GACAAGGTCA AAGACCAGAT CGATACCTCC AAGTTCGCCG AGGGAACGGT GGACCTGGGC
AAGATCGACG GCAAGCAGTA CACCATCACG GCAGGCGTGA ACTCGATGTC CATGGTCATC
GACCCCCAGG TCTTCGAGGC TGCCGGGGTG CCGCTGCCGA ACGACGAAAC CTGGACTTGG
GACGACTACG TGGACATTGC CGCGGAGATC GCGAAGAAGT CGCCGGCGGG CACGTTCGGC
ACCACGCCCA TGGCCAATGA TTCCTTCCTG GCCGTCTGGG CACGCCAGAA CAACGAGGCC
CTCTACACGG ATGACGGCAA GAAGATGGGA ATCAGCGAAG GCACCCTGAC CCGCTGGTTT
GAACTGAACA AGAAGCTCAT GGACACCGGC GGCGCCCCGT CGGCCTCCCA GACCGTGGAG
GACGGCTCGG CCCAGCCCGA ACTGACCCTC ATGGGCCAAG GCAAACAGGC CATGAAGATT
TCGTGGAGCA ACCAGATGAC GTCCTATTCC GGTTTCCCGC TGGTCATGAT GAAGATGCCC
GGTGAAAGCA AGCAGCCCGG AGCCTGGCTG CGTTCCTCCA TGGAATATGC CATTTCCTCG
AAATCCGCGC AGTCGAAGGA AGCCGCACTG TTCATCAACT ACCTGGTCAA CAACATGGAC
GCCGCCACCA AGATCAAGAG CGACCGTGGC ATGCCCGCCA ACACGGACTT GAAGGCCGGT
ATCACCCCGT TGCTGAAGGA AGGCCAGCAG AAGGAGGCCG CCTACCTGGA CCGGATCGCG
GAGCTGAATG TGGCCCCGCC CAAGCCGTTC CCGGCCGGTT CTTCCGCCAC CCTTGAGGTG
CTGAATCGCT ACAACACGGA CGTTTTGTTC GGGAAGATAT CCCCGCAGGA CGCTGCGAAG
GGCTTCATTT CCGAGGTCAA CCAGAACCTG GGCTGA
 
Protein sequence
MRLALKKSVM GVAGATAALA LVLTGCGNSP QAGKVGTAED PVTIRFAWWG NDSRAKTTLE 
VIKDFEAANP TIKVQGENTE FSSYWDKMAT QIAGGTTPDV FAMSGAYPSE YASRGVLLDL
DKVKDQIDTS KFAEGTVDLG KIDGKQYTIT AGVNSMSMVI DPQVFEAAGV PLPNDETWTW
DDYVDIAAEI AKKSPAGTFG TTPMANDSFL AVWARQNNEA LYTDDGKKMG ISEGTLTRWF
ELNKKLMDTG GAPSASQTVE DGSAQPELTL MGQGKQAMKI SWSNQMTSYS GFPLVMMKMP
GESKQPGAWL RSSMEYAISS KSAQSKEAAL FINYLVNNMD AATKIKSDRG MPANTDLKAG
ITPLLKEGQQ KEAAYLDRIA ELNVAPPKPF PAGSSATLEV LNRYNTDVLF GKISPQDAAK
GFISEVNQNL G