Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3359 |
Symbol | |
ID | 4444088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3776921 |
End bp | 3778135 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691182 |
Product | hypothetical protein |
Protein accession | YP_832834 |
Protein GI | 116671901 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCAGCAG CACCACCTAT GCTTCAGAGC ATGAGCCGCA AAAGTGTGAT CTTCATTGAC GCAGGTTTCC TCCTGGCCAC AGGCGGGCTC CGGGTCACCG GGAACTCGCT GCGCTCTGCC TTTTCCGTCC AGTACAAGAG CCTTGTGGAC GGCATCCAGG GGTTTGTCAG CGAGCGCGAC AGCCGGGACC TGCTGAGGAT GTACTGGTAT GACGCCGCCA AGGACGGCTT GTTCAGCGAC GAACACAAGC GGATCGGACT GCTCCCGGGC GTGAAAGTCC GGCTCGGGCG CATGTCCTAC AACGGCGAAC AAAAAGGCGT GGACCTTAAG CTCGGGCTGG ATCTGGTGGG AGTGGCCCGG AACCGGTCGG CTGACGTCGC CTACCTGCTC TCCGGGGACG ATGACCTCGC AGAGGCTGTG GAAGCGGCGC AGGACCTCGG CATGAAGGTG GTGCTGCTGG GAATTGAGAA CCAGGGCCAC CGGCTGGGGG TCACCGCGGT GGCTGAGCAT CTGGCACTGC AGGTTGACGA CATTGCCACC CTGCCGCAGA CGCTCCTGGA CCGCTGCTTT GCGAAGTCGG CGCCAGTGGC TGAGCTTGCC GCCTACGCCC CCGCTTCGTC CGAAACCGGA ACTACAGTCA GCATTTCCGG CAACGGTGCA GGCCATCTTC CCGGCAGGCG GCCTGTCCCC GGCCCGGGCG TTGTTCCCGC CCGCAGTGCC ACCGCGGCTC CGGAAGCTGG CTCCGCGCCG GGTTCCGCTG CCGAGGCTGC CGGATCCGGC GCGCCTTCAG GGCCCGCGGC TGCTTCTGCC GCGGCTGCTG CGGGCACGGC AGGTTTATCC GGCAGGCCGG TCCCGACGCC CGGGCCGCGG CGTGTGCAGC CGGACCTGCG TCCGGTGCCC GCAGCCGGCG TTGCGCGCCG GGAACCGGTC TATTCCACGG CAACTGGCGC CCCTGCCGCC CAAAGCCCCT GGTTCGACCT GGTGGAAAGT GCCGAAGCGG TGGCTGTCAG CCTCGCCGAT AACTGGCACG GCAGCGTCAG CCAGCGCGAA CTCAACGAAC TCCTCGCCGA GCGGCCGCTG CTGCCGCCGA CGATTGACCG GGTGCTCATC AAGGACTGTG CCGCGAAGAT CGGCGAAGCC AAGACGGACC TCCAGGACAT CCGCAAGGCC ATCCGGGCGG CGTTCTGGCG CCGGTTGGAT GAGCTCGTCC AGTAG
|
Protein sequence | MSAAPPMLQS MSRKSVIFID AGFLLATGGL RVTGNSLRSA FSVQYKSLVD GIQGFVSERD SRDLLRMYWY DAAKDGLFSD EHKRIGLLPG VKVRLGRMSY NGEQKGVDLK LGLDLVGVAR NRSADVAYLL SGDDDLAEAV EAAQDLGMKV VLLGIENQGH RLGVTAVAEH LALQVDDIAT LPQTLLDRCF AKSAPVAELA AYAPASSETG TTVSISGNGA GHLPGRRPVP GPGVVPARSA TAAPEAGSAP GSAAEAAGSG APSGPAAASA AAAAGTAGLS GRPVPTPGPR RVQPDLRPVP AAGVARREPV YSTATGAPAA QSPWFDLVES AEAVAVSLAD NWHGSVSQRE LNELLAERPL LPPTIDRVLI KDCAAKIGEA KTDLQDIRKA IRAAFWRRLD ELVQ
|
| |