Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0947 |
Symbol | |
ID | 4446540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1017595 |
End bp | 1018893 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688753 |
Product | hypothetical protein |
Protein accession | YP_830444 |
Protein GI | 116669511 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAGCG GCACGTTGGT CAACTTCGCC TTGGTTCTCT TCTTCGTACT GCTGGGCGGC GTCTTTGCAG CCACCGAAAT GGCGCTCATT TCCCTCCGGG AAAGCCAGGT GCGCATGATC GAGAAGGCCG GCAAACGCGG CGCCCGGGCC GCTGCGCTGG CCCGCAACCC CAACCGGTTC CTCTCCACCG TGCAGATCGG CGTGACGCTC TCCGGCTTCT TCTCAGCCGC CTACGGCGCG TCCACCATTT CACCCGACAT CGAACCCATC CTGAAAGGCG CGGGGTTCGG CGCCGCGGCC GAGCCGGTGG CCTTTATCGG CATAACCCTG CTGGTGGCCT ACCTGTCCCT GGTGCTGGGC GAGCTGGTGC CCAAAAGGCT GGCTATGCAG AGCGCCGTCG GCTTCACCAA GGTCCTGGCC CCGCCGCTGG TGGTCCTTTC CGAGGTCATG CGGCCCGTCA TCTGGCTGCT GTCCGTTTCC ACCGACGCCG TGGTCCGGCT CTTTGGCGGT GACCCGCACG CCAAGCGGGA GGGGATCAGC TCCGAGGAAC TCTGGGACAT GGTGGCGGAG AGCGACCTGC TGGAAGAGAG CAGCCGGCAC ATCCTGACCG ACGTGTTCGG CGCCGGGGAC CGCACACTGC AGGAGGTCAT GCGCCCCCGC ACCGAAGTGA CCTTCATTGA CGGCACCATG ACTATTGCCG ACGCACGCAG CATGGTCCGG GACGGCCCGT ATTCGCGGTT CCCTGTGATC GGCAGGACCC CGGACGACGT CCTGGGCTTC GTCCACATCC GGGACCTGAT GACCCGGACT GAACAGCAGG ACCAGGGGCT GGTGAAGGAC ATCGTCCGCG AACTCCTCCC CCTGCCGGGA ACCAACCGGG TGCTGCCGAC GCTGTCGCGG ATGCGCCGGC TGGGCCACCA CATCGCGCTG GTGGTGGACG AATACGGCGG CACCGACGGC ATCGTCACGC TGGAGGACCT GGTCGAGGAG TTGGTGGGCG AAATCTACGA CGAATACGAC ACCGGGGCCG ACCACGAGGA CCGCGTCACC GTGGCCAACG GATCCATCGA CGTGGACGGC GGCCTGATCC TGCAGGAATT CGCCGCTGCC ACCGGCATCA CCCTGCCGGA GGGCCGCTAC GAGACAGTGG CCGGGTTCGT CATCTCCCGC CTGGGCCGCC TGCCCGTGGT CGGGGACCGG GTGCAGGTGC CGGGCCAAGT GCTGACGGTG CTCGCCATGG ACAGGCTCCG CATCGCCCGG ATCCGGGTGA CGCCCGTGAC CGGGCAGCCG GCGGTCTAG
|
Protein sequence | MDSGTLVNFA LVLFFVLLGG VFAATEMALI SLRESQVRMI EKAGKRGARA AALARNPNRF LSTVQIGVTL SGFFSAAYGA STISPDIEPI LKGAGFGAAA EPVAFIGITL LVAYLSLVLG ELVPKRLAMQ SAVGFTKVLA PPLVVLSEVM RPVIWLLSVS TDAVVRLFGG DPHAKREGIS SEELWDMVAE SDLLEESSRH ILTDVFGAGD RTLQEVMRPR TEVTFIDGTM TIADARSMVR DGPYSRFPVI GRTPDDVLGF VHIRDLMTRT EQQDQGLVKD IVRELLPLPG TNRVLPTLSR MRRLGHHIAL VVDEYGGTDG IVTLEDLVEE LVGEIYDEYD TGADHEDRVT VANGSIDVDG GLILQEFAAA TGITLPEGRY ETVAGFVISR LGRLPVVGDR VQVPGQVLTV LAMDRLRIAR IRVTPVTGQP AV
|
| |