Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3233 |
Symbol | |
ID | 4444014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3641760 |
End bp | 3644198 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639691057 |
Product | hypothetical protein |
Protein accession | YP_832709 |
Protein GI | 116671776 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACAG TGCACGGCGG GCCGTCCCGG TTGGACGGCC CGCCGGCCTC TTACGCGCCC ACCACCGCCG GACGCCAGCC GGAAGACTCC GGGCCCGCTG TTGATTTGCC CGACGTTCCC TTCCGGGCCG ACGGCATCGA GCTGATCGGC GAAACACAGG GATCCGGCTA CCGCGAGCCG CCCTCGCTGG TGAGGCGCGC CGACGGGCAG GCCATCCAGC TCACCCGCCT GCTTTACTTG GTACTCGAGG CGATCGACGG CAACCGCAGC GTCGACGAGG TTGCAGAGCA TGCCAGCGCC CGCTTCGGCA GGCTGGTCAG CCCGGACAAC GTCCGCACGT TGATCAGCTC GCAGCTGCTG CCCCTGGGAC TGCTCCGGCT GGCCGACGGT TCGCAGCCGG AGGTCAGGAA AGCCGACCCG CTGCTGGGGA TGCGCTTCCG CTACACCGTC ACCGATCCGG ACCGCACGCG GAAACTGACC GCCCCGTTTG CAGCGCTCTT CAATCCGCTC ATCATCGTGG CGGTGTGCGC AGCGTTCCTC GCTTCCTGCT GGTGGGTGCT GATGGTCAAG GGACTCGGCT CCGCCACGCA CGACGCCTTC GCCAACCCGG CCCTGGTGCT GCTGGTCCTG GCCGTCACCG TTTTGTCCGC CGGCTTCCAC GAGTTCGGCC ATGCCGCCGC CGCACGCCGT GGCGGTGCCA CGCCGGGAGC GATGGGCGCC GGCCTCTACC TGATCTGGCC CGCTTTCTTC ACCGACGTCA CCGACTCCTA CCGGCTGGGC CGCGGCGGCC GGATCCGCAC GGACCTTGGC GGACTGTATT TCAACGCGAT CGTGGCCGTG GCCATCATGG GTGTCTGGTG GGCCACCGGT TTCGACGCGC TGCTGCTGGT GGTGGTCACC CAGATCCTGC AGATGGTCCG GCAGCTCCTC CCCCTGGTCA GATTCGACGG CTACCACATC CTGGCCGACG CCACCGGGGT CCCGGACCTC TTCCAGCGCA TCAAGCCGAC CCTGTTGGGA CTGCTGCCCT GGCGCCGGTC GGACCCGGAG GCCCAGGTGC TCAAGCCTTG GGCCCGGGCC GTGGTGACCA TCTGGGTGCT GGTCACCGTG CCGCTGCTGT TGTTCAGCCT GGCAATGATG GTGATCTCAC TGCCGCGGCT TCTGGGCACG GCGTGGGCCA GCGTGCTCAA ACAACAGTCC CAGCTGACCG ACAGCCTCGC TGCCGGGGAC GTCGCCGGCG CCGCCGTCCG CGCCCTGGCG ATCGCCGCCG TCGCGCTGCC CGTGGTGGGC ATCTTCTACG TCCTGCTGCG CCTGGTCCGT CAGCTGACCA CGGGGCTCTG GCAGAAGACC CGCGGCAAGG CAATCCAGCG CGGGGTCGCG ATGGCCGCCG TCGCTGCTGT GACCGCCGGC CTGGCCTGGG CGTGGTGGCC CGGAGCGGAC ACGTACCGGC CGGTGCAGCC GTACGAACGC GGCACCCTGG CTGACGTTAC GACGGCGGTG TTCCCCACGG CGTCGTCCAC AACGCTTCGG GAAGGACGCG CGGGAAAGAC TGTGGCACTG TGGCCTGCGG GTGCAGCCAA GCCCACGAGG GAACAGCCCC AGCTGAGCAT GGTGATGGTG CCCCGCACGG GTCCAGCCGC CGCCGGCACC CCGGACGCCG GCAGCGGTGC CGCCGCACCG CCGTCGTGGG TGTTCCCGTT CAACCAGCCG GCCGCACCCG AAGAGGGCGA CAACCAGGCG CTGGCGGTCA ACACGCAGGA CGGCTCGGTG GTGTACGACG TCGCCTTCGC GCTCGTCTGG GCCGAGGACG GCGAACCGGT GGACACCACC AACGAGGCCT ACGCCTTCGC CAGCTGCTCC GACTGCGCCG CGGTGGCAGT GGGTTTCCAG GTGGTGCTGA TCGTGGGCCA GGCGGATGTG ATTGTTCCGG AGAACCTGTC CGCAGCCGCG AACTACAACT GCGTCCGGTG CCTCACGTAT GCGCTGGCCA ACCAGCTGGT GCTCACGCTG GACGGACCGC TCAGCGGTGA CGGCATGGCC CGGCTCAACG CGCTGTGGGC CGAGATTGCC GAATTCGGGC GGAACCTGCA GAACGTTCCG CTGTCCGAAA TCCAGGGACG CCTCGAAGGA TTCAAGGAGC AGGTCATGGA GATCGTCCGG AACGACCCCA GCGCCACCAA GGGCGCCGCG ACGTCCGCGA CACCAAGCTC CACGGCTACC GCGACCCCCG GATCCAGCCA GGCCCCCTCA CCCGGAGCCA CGGCGGCGCC AACGGTCCCC GCAGGAGCGA CGACGGCGGA TCCGGCGCCC GCTGCTCCCG CCACCGGAGG TGCGGCAACG GAGACACCAG CTGCGACTGC GGAGCCGACG ATCACGCCGA CCGTGACGCC GACGGAACCG GCACTGGCCA CACCTGGACC CACGTCGAAC GGCGAGTAA
|
Protein sequence | MSTVHGGPSR LDGPPASYAP TTAGRQPEDS GPAVDLPDVP FRADGIELIG ETQGSGYREP PSLVRRADGQ AIQLTRLLYL VLEAIDGNRS VDEVAEHASA RFGRLVSPDN VRTLISSQLL PLGLLRLADG SQPEVRKADP LLGMRFRYTV TDPDRTRKLT APFAALFNPL IIVAVCAAFL ASCWWVLMVK GLGSATHDAF ANPALVLLVL AVTVLSAGFH EFGHAAAARR GGATPGAMGA GLYLIWPAFF TDVTDSYRLG RGGRIRTDLG GLYFNAIVAV AIMGVWWATG FDALLLVVVT QILQMVRQLL PLVRFDGYHI LADATGVPDL FQRIKPTLLG LLPWRRSDPE AQVLKPWARA VVTIWVLVTV PLLLFSLAMM VISLPRLLGT AWASVLKQQS QLTDSLAAGD VAGAAVRALA IAAVALPVVG IFYVLLRLVR QLTTGLWQKT RGKAIQRGVA MAAVAAVTAG LAWAWWPGAD TYRPVQPYER GTLADVTTAV FPTASSTTLR EGRAGKTVAL WPAGAAKPTR EQPQLSMVMV PRTGPAAAGT PDAGSGAAAP PSWVFPFNQP AAPEEGDNQA LAVNTQDGSV VYDVAFALVW AEDGEPVDTT NEAYAFASCS DCAAVAVGFQ VVLIVGQADV IVPENLSAAA NYNCVRCLTY ALANQLVLTL DGPLSGDGMA RLNALWAEIA EFGRNLQNVP LSEIQGRLEG FKEQVMEIVR NDPSATKGAA TSATPSSTAT ATPGSSQAPS PGATAAPTVP AGATTADPAP AAPATGGAAT ETPAATAEPT ITPTVTPTEP ALATPGPTSN GE
|
| |