Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3041 |
Symbol | |
ID | 4444305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3407921 |
End bp | 3409210 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639690865 |
Product | extracellular solute-binding protein |
Protein accession | YP_832520 |
Protein GI | 116671587 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.887996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCGCAG CACTGCCCAG CAGGCGTTCC CTGCTCCGGG CCGCCGGACT GGCCGGAGCG GCATTCATGG TTCCCCTTTC CGGCTGTGGC GCCGGGCCCG CTGCACAGGA CGGGGTCACC ACGCTCCGGT TCATGCAGAA CAAACCCGAA GTAGTGGACT ACTTCAACCA GGTGATCAAG GACTTTGAGG CGCTGAACCC GGACATCCGG GTGGTCCAGG ACTTCAACGA GGGCAACTTT GTTCCGGGCC TCGTCCGCAA TGATCCGCCG GATGTGGTGA CCCGCGGTTT CGCGCAGGCC ACCGCCGACT TCGTCAAGAA GGGCATCTTC GCCGACCTCT CGGACATGCC CGTGGCAAAC ACCATCGACC CGAAGATGCA GGAGCTCATC AACTCCTGGG GCCAGTACAA CGGCACCGAG ATCAGTGCCC TGCCGTTCTC GCTTGCGGCA GCCGGCGTCA TCTACAACCG TGACATCTTC GAGGCCCAGG GGGTCCCTGT TCCCACCACC TGGGACGGAT TCGTTGCGGC CTGCGAGAAG TTCAAGGCGG CCGGCATCGC TCCGATCTAC GGAACGTACA AGGACCCCTG GACCCTCGCC CAGGGCATGT TCGATTACGC GTCCGGCGGC ACGCTGGACG TTGCGGAATT CTTCATGAAG CTCAAGGCCA GGGGTGCCGG CATCACCAAG GACTCCGCAG AGTCGTTTGC CAACAACTTC GGCCCTGCCC TCCCCAAGAT GCTCCAGCTT GCCTCGTTCT CGCAGAACGG AGCGGCCAGC AAGAACTACG CGGATGGCAA TGCCGCGTTC GCCAAGGGGC AGGCAGCCAT GTATCTGCAG GGCCCGTGGG CGCTATCGCA GCTCGTGGCA GCCAACAAGA ACATCAGGCT GGGAACGTTC CCGCTGCCGG TGACGAACAA CCCGGCGGAG ACCAAGGCGC GCGTGAACGT GGATATGGCG CTCTCCATCA CCCGCAACAC GCCCAACATG GCTGCGGCTC GGCGGTTCGT CAACTACCTG CTGGAACCGT CCGTGGTGAA CACGTACAAC GAGAAGAATG CTGCGTTCTC GCCGCTCAAG GACGCACCCG CCGTCAGCAA TCCGCAGATC ACCGGGTTGA ATGCTGCGGT CCAAGAGGCC CGCTACTTCC AGGGCGCCGT TACCTACTTC CCGCCGTCGG TCCCCGTAAA CAACTACATC CAGTCCTTCG TGTACGGCAA GAACGGCGAC CAGTTCCTTT CCGCGCTCGA CGACGAATGG CGCCGGGTCG CCGAGCGTAC CGCGGTCTAA
|
Protein sequence | MPAALPSRRS LLRAAGLAGA AFMVPLSGCG AGPAAQDGVT TLRFMQNKPE VVDYFNQVIK DFEALNPDIR VVQDFNEGNF VPGLVRNDPP DVVTRGFAQA TADFVKKGIF ADLSDMPVAN TIDPKMQELI NSWGQYNGTE ISALPFSLAA AGVIYNRDIF EAQGVPVPTT WDGFVAACEK FKAAGIAPIY GTYKDPWTLA QGMFDYASGG TLDVAEFFMK LKARGAGITK DSAESFANNF GPALPKMLQL ASFSQNGAAS KNYADGNAAF AKGQAAMYLQ GPWALSQLVA ANKNIRLGTF PLPVTNNPAE TKARVNVDMA LSITRNTPNM AAARRFVNYL LEPSVVNTYN EKNAAFSPLK DAPAVSNPQI TGLNAAVQEA RYFQGAVTYF PPSVPVNNYI QSFVYGKNGD QFLSALDDEW RRVAERTAV
|
| |