Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0291 |
Symbol | |
ID | 4447225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 305165 |
End bp | 306757 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639688087 |
Product | general substrate transporter |
Protein accession | YP_829792 |
Protein GI | 116668859 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAAG ACCGAAGCAT GACCGACTCT TCAGCAGGCG TGAACAACGC CCGCGGAACC GACAATATTT TGCCTGAGGG CGTTGCGCCC CAGGGCGCCA AGAAGCCGAG GCTGCTGCCT CGCCGCAGGC TCAAAGTCTC CGACGTCAAC GTCGTCAACA AGCCGATGCT CAAAAAAGCA CTCGGCGGAA CCATCGTGGG TAACACCATG GAGTGGTACG ACGTCGGCGT GTTCGGCTAC CTGATCACCA CCATGGGTCC GGTGTTCCTG CCGGAGTCCG ACCCGTCCAC GCAGACGCTG TTCCTGCTGG GCACGTTTGC CGCCACATTC ATCGCCCGTC CTCTTGGCGG CGTGATCTTC GGCTGGTTCG GTGACAAGGT TGGCCGCCAG AAGGTCCTGG CAGCAACCCT GATGCTGATG GCGGCCAGTA CGTTCGCCAT CGGCCTTCTC CCCGGCTACG CCCAGATCGG TTTGTGGGCG GCCGGATTGC TGGTGCTGCT GAAAATCGTG CAGGGCTTCT CCACCGGCGG CGAGTACGCC GGAGCCACCA CCTTCGTGAG CGAGTACGCT CCGGACAAGC GCCGCGGCTT CTTCGCCAGC TTCCTTGACC TGGGAAGCTA CCTCGGCTTT GCAATCGGTG CCGCCCTCGT TTCAGCCCTG CAGCTGACCA TGGGCCAGGC TGCGATGGAA GAGTGGGGCT GGCGCATCCC GTTCCTGCTC GCCGGTCCCC TGGGCCTCAT CGCGGTCTAC TTCCGGAGCA AGATCGAGGA ATCCCCGCAG TTCCAGGCCA CCCTGGACGC GCAGGAAGAA CTCAGCAAGG ACGCTGCCAA GTCCTCCGAC GCTGCTTCCA AGAGCCCGGT GGGCGTTGTC AAGGCCAACT GGCGGCCCAT TATTGTGGCC ATGATCCTTG TGGCTGCGGC CAACACCGCC GGCTACGCGC TGACCTCCTA CATGCCGACG TACCTCACGG ATGCCAAGGG TTACGACCCT GTCCACGGCA CGCTGCTGAC CATCCCGGTG TTGGTCGTCA TGAGCCTGTG CATTCCGCTG ACTGGAAAGC TTTCGGACCG CATCGGACGC CGCCCGGTCC TGTGGATCGG TGCCGTGAGC ACCATCGTGC TGGCCACCCC CGCCTTCCTG CTCATTGGCG TTGGCGAGAT CTGGTCGACC CTGGCCGGCC TGGCACTGAT CGCCTTCCCC GTCACGTTCT ATGTGGCCAA CCTGGCCTCG GCCCTGCCCG CGCAGTTCCC GACGGCCAGC CGGTACAGCG CCATGGGTAT CGCCTACAAC TTCTCGGTAG CGATTTTCGG CGGCACCACG CCTTTCATCG TGGCGGCGCT GATCAAGGCG ACCGGCAACG ACATGATGCC CGCGTACTAC CTGATGGCTA CATCAGCCGT TGGCGCAGTG GCCATCTACT TCCTGAAGGA ATCCGCCAAC CGTCCGCTGC CCGGCTCCAT GCCTAGCGTG GACACCCAGG CGGAGGCCCA CGAGCTGGTG GCCACCCAGG ACGAGAACCC CCTGATCGAC CTGGACGACA TGCCGTTTGA GGATGAGCTG CGGGAAACCG AAAAGGTTCC TGCGAGGGCC TGA
|
Protein sequence | MPKDRSMTDS SAGVNNARGT DNILPEGVAP QGAKKPRLLP RRRLKVSDVN VVNKPMLKKA LGGTIVGNTM EWYDVGVFGY LITTMGPVFL PESDPSTQTL FLLGTFAATF IARPLGGVIF GWFGDKVGRQ KVLAATLMLM AASTFAIGLL PGYAQIGLWA AGLLVLLKIV QGFSTGGEYA GATTFVSEYA PDKRRGFFAS FLDLGSYLGF AIGAALVSAL QLTMGQAAME EWGWRIPFLL AGPLGLIAVY FRSKIEESPQ FQATLDAQEE LSKDAAKSSD AASKSPVGVV KANWRPIIVA MILVAAANTA GYALTSYMPT YLTDAKGYDP VHGTLLTIPV LVVMSLCIPL TGKLSDRIGR RPVLWIGAVS TIVLATPAFL LIGVGEIWST LAGLALIAFP VTFYVANLAS ALPAQFPTAS RYSAMGIAYN FSVAIFGGTT PFIVAALIKA TGNDMMPAYY LMATSAVGAV AIYFLKESAN RPLPGSMPSV DTQAEAHELV ATQDENPLID LDDMPFEDEL RETEKVPARA
|
| |