Gene Arth_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0291 
Symbol 
ID4447225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp305165 
End bp306757 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content64% 
IMG OID639688087 
Productgeneral substrate transporter 
Protein accessionYP_829792 
Protein GI116668859 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAAG ACCGAAGCAT GACCGACTCT TCAGCAGGCG TGAACAACGC CCGCGGAACC 
GACAATATTT TGCCTGAGGG CGTTGCGCCC CAGGGCGCCA AGAAGCCGAG GCTGCTGCCT
CGCCGCAGGC TCAAAGTCTC CGACGTCAAC GTCGTCAACA AGCCGATGCT CAAAAAAGCA
CTCGGCGGAA CCATCGTGGG TAACACCATG GAGTGGTACG ACGTCGGCGT GTTCGGCTAC
CTGATCACCA CCATGGGTCC GGTGTTCCTG CCGGAGTCCG ACCCGTCCAC GCAGACGCTG
TTCCTGCTGG GCACGTTTGC CGCCACATTC ATCGCCCGTC CTCTTGGCGG CGTGATCTTC
GGCTGGTTCG GTGACAAGGT TGGCCGCCAG AAGGTCCTGG CAGCAACCCT GATGCTGATG
GCGGCCAGTA CGTTCGCCAT CGGCCTTCTC CCCGGCTACG CCCAGATCGG TTTGTGGGCG
GCCGGATTGC TGGTGCTGCT GAAAATCGTG CAGGGCTTCT CCACCGGCGG CGAGTACGCC
GGAGCCACCA CCTTCGTGAG CGAGTACGCT CCGGACAAGC GCCGCGGCTT CTTCGCCAGC
TTCCTTGACC TGGGAAGCTA CCTCGGCTTT GCAATCGGTG CCGCCCTCGT TTCAGCCCTG
CAGCTGACCA TGGGCCAGGC TGCGATGGAA GAGTGGGGCT GGCGCATCCC GTTCCTGCTC
GCCGGTCCCC TGGGCCTCAT CGCGGTCTAC TTCCGGAGCA AGATCGAGGA ATCCCCGCAG
TTCCAGGCCA CCCTGGACGC GCAGGAAGAA CTCAGCAAGG ACGCTGCCAA GTCCTCCGAC
GCTGCTTCCA AGAGCCCGGT GGGCGTTGTC AAGGCCAACT GGCGGCCCAT TATTGTGGCC
ATGATCCTTG TGGCTGCGGC CAACACCGCC GGCTACGCGC TGACCTCCTA CATGCCGACG
TACCTCACGG ATGCCAAGGG TTACGACCCT GTCCACGGCA CGCTGCTGAC CATCCCGGTG
TTGGTCGTCA TGAGCCTGTG CATTCCGCTG ACTGGAAAGC TTTCGGACCG CATCGGACGC
CGCCCGGTCC TGTGGATCGG TGCCGTGAGC ACCATCGTGC TGGCCACCCC CGCCTTCCTG
CTCATTGGCG TTGGCGAGAT CTGGTCGACC CTGGCCGGCC TGGCACTGAT CGCCTTCCCC
GTCACGTTCT ATGTGGCCAA CCTGGCCTCG GCCCTGCCCG CGCAGTTCCC GACGGCCAGC
CGGTACAGCG CCATGGGTAT CGCCTACAAC TTCTCGGTAG CGATTTTCGG CGGCACCACG
CCTTTCATCG TGGCGGCGCT GATCAAGGCG ACCGGCAACG ACATGATGCC CGCGTACTAC
CTGATGGCTA CATCAGCCGT TGGCGCAGTG GCCATCTACT TCCTGAAGGA ATCCGCCAAC
CGTCCGCTGC CCGGCTCCAT GCCTAGCGTG GACACCCAGG CGGAGGCCCA CGAGCTGGTG
GCCACCCAGG ACGAGAACCC CCTGATCGAC CTGGACGACA TGCCGTTTGA GGATGAGCTG
CGGGAAACCG AAAAGGTTCC TGCGAGGGCC TGA
 
Protein sequence
MPKDRSMTDS SAGVNNARGT DNILPEGVAP QGAKKPRLLP RRRLKVSDVN VVNKPMLKKA 
LGGTIVGNTM EWYDVGVFGY LITTMGPVFL PESDPSTQTL FLLGTFAATF IARPLGGVIF
GWFGDKVGRQ KVLAATLMLM AASTFAIGLL PGYAQIGLWA AGLLVLLKIV QGFSTGGEYA
GATTFVSEYA PDKRRGFFAS FLDLGSYLGF AIGAALVSAL QLTMGQAAME EWGWRIPFLL
AGPLGLIAVY FRSKIEESPQ FQATLDAQEE LSKDAAKSSD AASKSPVGVV KANWRPIIVA
MILVAAANTA GYALTSYMPT YLTDAKGYDP VHGTLLTIPV LVVMSLCIPL TGKLSDRIGR
RPVLWIGAVS TIVLATPAFL LIGVGEIWST LAGLALIAFP VTFYVANLAS ALPAQFPTAS
RYSAMGIAYN FSVAIFGGTT PFIVAALIKA TGNDMMPAYY LMATSAVGAV AIYFLKESAN
RPLPGSMPSV DTQAEAHELV ATQDENPLID LDDMPFEDEL RETEKVPARA