Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3324 |
Symbol | |
ID | 4443966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3731763 |
End bp | 3733046 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639691147 |
Product | extracellular solute-binding protein |
Protein accession | YP_832799 |
Protein GI | 116671866 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000420771 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGCA GAAGGAATTT CCTCACCACC GTGGCGCTGG GCACCGCATC TGCCGCTGCC CTGGCTGCAT GCGGACAGTC CGCTCCCGCA CAGACCGGAT CGGCGCAGAA TCCTGTCACC ATCAATTACA CCTGGTGGGG CAACGATGAC CGTGCCGCCC GCACCCGCAA GGCCATCGAA CTGTTCGAGT CGAAGAACCC CGACATCAAG GTCAACGGCA ACTTCACCGA CTTTGCCGGC TACTGGCAGA AGCGCGCCAC CGAAGCTGCC GGCGGCGGCC TGCCTGACGT GATGCAGTGG GACCTCTCCT ACCTGCGGGA CTATGGCCAG CGGAACCAGC TCCTGGACCT CGGCACCGTA AAGATTGACA CTTCGGGCTT CGACAAGTCG CTGCTGCCGT CCGGCCAGAT CAAGGGCAAG ACCTACGGCA TCCCCACCAG CACCAACGCT TTCGCCGTCT ACTATGATCC TGCCAAGCTC AAGACCCTCG GCGTTTCCGA GCCCGACGGC AGCTGGACCT ATGACGAATA CAAGGCCTGG CTCAAGGAGG TCGGCGCCAA GGGCAACGGC ACCATCTTCG GTTCCACGGA CTTCACCGGC ATCTGGTGGC AGTTCAACAT CTGGCTCCGC CAGAAGAACA TTGAAGCCTT CACCGAGGAC GGCAAGCTCG GCTTCAGCAA GGACGACCTG AAGACCTGGT GGAACATGGC TTCGGACCTG CGCGGTACCC CTGCCCTGGT CTCCGAGGAA CGGGTCACCC AGCTGGCACC CAAGTCGCCT TTTGGCTCCA AGGTGACCGT TGCGGAAAAC ACCTGGGACA ACTTCATGGC CGGCTACCTC GCGGACAGTG GCGCCAAGGA ACTGAAGATT GTCCCCGTTC CCTCCGACGA CCCGGACAAT CTGGGCCTGT TCCTCAAGCC CTCGATGCTC ATGGTGGCCA GCGCCAAGAC AAAGTTCAAG GATGCCTCCG CACGCTTCAT CGACTTCATG ATCAACGACC CCGAGGTGGG CCAGATCTTC AAGACCTCCC GCGGCGTTCC CGCCTCGAAG AAACAACGCG ACGGCACCAC CTTCGAGGGC ACCGACAAGA TCGTGGTGGA CTACGAGAAG TCCATCGAAA AGTACCTCAA GGATGCACCC GAGCCGCCCA TCGTCGGTTT CGGCACCCTG GAAGCCTCCT TCAAGCGCAT CAGCTCGGAC CTCAACTTCG GCAAGCTGAC CGTTGACTCG GCGGCCGACG CCTGGTTCAA GGAAGCCGAA GACCTCATCA AGCAGAACGC CTGA
|
Protein sequence | MISRRNFLTT VALGTASAAA LAACGQSAPA QTGSAQNPVT INYTWWGNDD RAARTRKAIE LFESKNPDIK VNGNFTDFAG YWQKRATEAA GGGLPDVMQW DLSYLRDYGQ RNQLLDLGTV KIDTSGFDKS LLPSGQIKGK TYGIPTSTNA FAVYYDPAKL KTLGVSEPDG SWTYDEYKAW LKEVGAKGNG TIFGSTDFTG IWWQFNIWLR QKNIEAFTED GKLGFSKDDL KTWWNMASDL RGTPALVSEE RVTQLAPKSP FGSKVTVAEN TWDNFMAGYL ADSGAKELKI VPVPSDDPDN LGLFLKPSML MVASAKTKFK DASARFIDFM INDPEVGQIF KTSRGVPASK KQRDGTTFEG TDKIVVDYEK SIEKYLKDAP EPPIVGFGTL EASFKRISSD LNFGKLTVDS AADAWFKEAE DLIKQNA
|
| |