Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0848 |
Symbol | |
ID | 4446651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 916899 |
End bp | 918875 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639688655 |
Product | hypothetical protein |
Protein accession | YP_830346 |
Protein GI | 116669413 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACGC TCACCAGGCC GCCGGCCGAC CCCTCGGACA GGTTCACCCG CAAGCCCCAC CGACTGCGGA GCTGGCTGCT GGAGGGCATG CCCGAAGGAT CCGGCAAACG CCAGGGGCCG CACGGCCAGC CGCAGGCGAA CCACACCCCC CAGCCCTGGT GGAAAGTCAT GTGCCTCACC GGCGTCGACT ACTTCTCCAC TCTTGGCTAC CAGCCGGCCA TTGCCGCACT CGCCGCTGGA ATGGTCTCCC CGCTTGCCAC CGTGGTCCTG GTGGCGGTGA CGCTCCTCGG CGCGCTTCCG GTGTATCGCC GGGTGGCCTC GGAGAGTCCC CGCGGCGAAG GCTCCATCGC CATGCTGGAA CGGCTCCTGC CACGATGGGG CGGCAAGCTG TTCGTCCTGG CGCTCCTGGG ATTCGCCGCC ACGGACTTCA TGATCACCAT CACCCTCTCG GCAGCGGATG CCAGCGCCCA CGCCATCGAA AACCCGTTCG CCCCGGACTT CCTGCACGGC CAGGAAGTGG CCATCACGCT GGCACTGATT GCCGGCCTGG GCATAGTATT CCTCCGCGGT TTCAAGGAAG CCATCAACGT GGCCGTCATC CTGGTGGCAG TATTCCTGCT GCTTAACGCG GTAGTGGTGC TGGTGGGCAT CGCGCATGTA TTCAGCGAAG CCCACGTTGT CACCGACTGG TGGGCCGCCC TCAACCAGCA GCACGGCAAC CCGCTGGTGA TGATCGGCAT CGCGCTGCTG GTCTTCCCCA AACTCGCCCT TGGCCTGTCC GGCTTCGAAA CGGGAGTTGC CGTCATGCCC CAGATCCAGG GCAGCCCGGA CGACACTGAA GCAAAGCCCG ACGGCCGGAT CCGCGGCGCG CACAAGCTCC TGACCACTGC CGCCCTGATC ATGAGCGGGT TCCTCATTGC GTCCAGCTTC ATCACCACGT TCCTGATCCC GGCCGCTGAG TTTCAACCGG GGGGCAAGGC CAACGGCCGC GCCCTGGCCT TCCTGGCCCA CCAGTACATG GGCGACGGTT TCGGGACCGT GTACGACATC AGCACCATCG CCATCCTCTG GTTCGCCGGG GCGTCCGCGA TGGCGGGGCT CCTCAACCTG GTGCCGCGCT ACCTTCCACG CTTTGGCATG GCGCCGGCCT GGACCCGCGC CGTGCGGCCG CTGGTACTCG TGTTCACCGC CGTCGCCTTC CTCATCACGG TGGTGTTCGA GGCCAACGTC GAAGCACAGG GCGGCGCATA CGCCACCGGC GTGCTGGTGC TGATGACCTC CGCATCCATC GCCGTCACCT TGTCCGCCCG GCGCCGACAC CAACGCGGCC GGACGTTCGC CTTCGGTGCG ATCGCCGTCG TCTTCCTCTA CACAACGGTG GCGAACGTAG TGGAGCGCCC CGACGGCCTG AAGATCGCGG CGTTGTTCAT CCTCGGCATT GTGGTGGTCA GCTTCGCATC CCGCGTCCGG CGGTCGTTCG AACTTCGCGC CACCCACATC CGGCTTGACG AACGGGCCCT GGAGTTCATG GCAGCCAACG AAGAGGGGCC CATCCGGCTG ATTGCGCACG AACCCAAACA CCTCAGCGCA GCGAGGTACC GGGCCAAGCT GGAACATGCC CAGCTGGCCA ACCACCTGCC CGTGGACAGC GATGCCATTT TCATCGAGAT CCTGGTGGAC GACAGTTCAG ACTTCGAACA GGAGCTCATG GTCACCGGGA AAATCCGGCA CGGCTTCCGC ATCCTTGAGA TCCATAGCAA CAACGTGCCC AACACCCTGG CGGCCGTCCT GCTCCACCTC CGTGACGTCA CGGGCCTGAT GCCGCATATC TACTTCCGCT GGACCGAGGG CAACCCGCTG ACCAACCTGA CCCGGTTCCT GTTGTTTGGC GAGGGTGAGA TCGCCCCTGT GACCCGGGAA GTGCTGCGCG AGGCCGAGCC GGACGTCACG CGCCGGCCCT GGGTCCACGT CGGCTAG
|
Protein sequence | MTTLTRPPAD PSDRFTRKPH RLRSWLLEGM PEGSGKRQGP HGQPQANHTP QPWWKVMCLT GVDYFSTLGY QPAIAALAAG MVSPLATVVL VAVTLLGALP VYRRVASESP RGEGSIAMLE RLLPRWGGKL FVLALLGFAA TDFMITITLS AADASAHAIE NPFAPDFLHG QEVAITLALI AGLGIVFLRG FKEAINVAVI LVAVFLLLNA VVVLVGIAHV FSEAHVVTDW WAALNQQHGN PLVMIGIALL VFPKLALGLS GFETGVAVMP QIQGSPDDTE AKPDGRIRGA HKLLTTAALI MSGFLIASSF ITTFLIPAAE FQPGGKANGR ALAFLAHQYM GDGFGTVYDI STIAILWFAG ASAMAGLLNL VPRYLPRFGM APAWTRAVRP LVLVFTAVAF LITVVFEANV EAQGGAYATG VLVLMTSASI AVTLSARRRH QRGRTFAFGA IAVVFLYTTV ANVVERPDGL KIAALFILGI VVVSFASRVR RSFELRATHI RLDERALEFM AANEEGPIRL IAHEPKHLSA ARYRAKLEHA QLANHLPVDS DAIFIEILVD DSSDFEQELM VTGKIRHGFR ILEIHSNNVP NTLAAVLLHL RDVTGLMPHI YFRWTEGNPL TNLTRFLLFG EGEIAPVTRE VLREAEPDVT RRPWVHVG
|
| |