Gene Arth_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3041 
Symbol 
ID4444305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3407921 
End bp3409210 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID639690865 
Productextracellular solute-binding protein 
Protein accessionYP_832520 
Protein GI116671587 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.887996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGCAG CACTGCCCAG CAGGCGTTCC CTGCTCCGGG CCGCCGGACT GGCCGGAGCG 
GCATTCATGG TTCCCCTTTC CGGCTGTGGC GCCGGGCCCG CTGCACAGGA CGGGGTCACC
ACGCTCCGGT TCATGCAGAA CAAACCCGAA GTAGTGGACT ACTTCAACCA GGTGATCAAG
GACTTTGAGG CGCTGAACCC GGACATCCGG GTGGTCCAGG ACTTCAACGA GGGCAACTTT
GTTCCGGGCC TCGTCCGCAA TGATCCGCCG GATGTGGTGA CCCGCGGTTT CGCGCAGGCC
ACCGCCGACT TCGTCAAGAA GGGCATCTTC GCCGACCTCT CGGACATGCC CGTGGCAAAC
ACCATCGACC CGAAGATGCA GGAGCTCATC AACTCCTGGG GCCAGTACAA CGGCACCGAG
ATCAGTGCCC TGCCGTTCTC GCTTGCGGCA GCCGGCGTCA TCTACAACCG TGACATCTTC
GAGGCCCAGG GGGTCCCTGT TCCCACCACC TGGGACGGAT TCGTTGCGGC CTGCGAGAAG
TTCAAGGCGG CCGGCATCGC TCCGATCTAC GGAACGTACA AGGACCCCTG GACCCTCGCC
CAGGGCATGT TCGATTACGC GTCCGGCGGC ACGCTGGACG TTGCGGAATT CTTCATGAAG
CTCAAGGCCA GGGGTGCCGG CATCACCAAG GACTCCGCAG AGTCGTTTGC CAACAACTTC
GGCCCTGCCC TCCCCAAGAT GCTCCAGCTT GCCTCGTTCT CGCAGAACGG AGCGGCCAGC
AAGAACTACG CGGATGGCAA TGCCGCGTTC GCCAAGGGGC AGGCAGCCAT GTATCTGCAG
GGCCCGTGGG CGCTATCGCA GCTCGTGGCA GCCAACAAGA ACATCAGGCT GGGAACGTTC
CCGCTGCCGG TGACGAACAA CCCGGCGGAG ACCAAGGCGC GCGTGAACGT GGATATGGCG
CTCTCCATCA CCCGCAACAC GCCCAACATG GCTGCGGCTC GGCGGTTCGT CAACTACCTG
CTGGAACCGT CCGTGGTGAA CACGTACAAC GAGAAGAATG CTGCGTTCTC GCCGCTCAAG
GACGCACCCG CCGTCAGCAA TCCGCAGATC ACCGGGTTGA ATGCTGCGGT CCAAGAGGCC
CGCTACTTCC AGGGCGCCGT TACCTACTTC CCGCCGTCGG TCCCCGTAAA CAACTACATC
CAGTCCTTCG TGTACGGCAA GAACGGCGAC CAGTTCCTTT CCGCGCTCGA CGACGAATGG
CGCCGGGTCG CCGAGCGTAC CGCGGTCTAA
 
Protein sequence
MPAALPSRRS LLRAAGLAGA AFMVPLSGCG AGPAAQDGVT TLRFMQNKPE VVDYFNQVIK 
DFEALNPDIR VVQDFNEGNF VPGLVRNDPP DVVTRGFAQA TADFVKKGIF ADLSDMPVAN
TIDPKMQELI NSWGQYNGTE ISALPFSLAA AGVIYNRDIF EAQGVPVPTT WDGFVAACEK
FKAAGIAPIY GTYKDPWTLA QGMFDYASGG TLDVAEFFMK LKARGAGITK DSAESFANNF
GPALPKMLQL ASFSQNGAAS KNYADGNAAF AKGQAAMYLQ GPWALSQLVA ANKNIRLGTF
PLPVTNNPAE TKARVNVDMA LSITRNTPNM AAARRFVNYL LEPSVVNTYN EKNAAFSPLK
DAPAVSNPQI TGLNAAVQEA RYFQGAVTYF PPSVPVNNYI QSFVYGKNGD QFLSALDDEW
RRVAERTAV