Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3475 |
Symbol | |
ID | 4443785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3908562 |
End bp | 3909545 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691299 |
Product | putative capsule biosynthesis protein |
Protein accession | YP_832950 |
Protein GI | 116672017 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTGA TATTCCTGGG TGACGTGATG CTGGGCCGCC TGGTGAACCG GCAGCTCAGG ACCGTCCCGC CTGCCTTCCC CTGGGGCGAT ACCCTCCCCG TCCTGGCACA GGGGGACATC CGGTTCGCCA ACCTCGAATG CGTCCTGGCC GACGGCGGGA CGCCGCAACC GGGGAAAGTG TTCCATTTCC GCTCGGACGC CCGCAACGCG GCCGTCCTGG CCGCGGCGGG CATCGACGCC GTATCGCTTG CCAACAACCA TGTGCTCGAC TACGGGGCGG ACGCCTTCCG GGAGACGCTG CCGGCGCTGG AACGCAGCGG AATCCTGCAC GCCGGCGCGG GGCCGGACCT GGAGGCCGCG CAAAGGCCGG CGATCAGGCG GGCGGGTCCG GCCGCCGTCG GGCTCATCGC CTTCACGGAC AACCAGCCGG ACTGGGAGGC GGGTCCGGAC CGCCCCGGGG TGTATTACGT GCCGGTGGCC GGGCGGCAGC AGGACGACAG AAGGGTCCAT GACCTGCTGG CGCTGGTCCG CCGCACCAAG GCCAGGGCCG ATCTGCTGGT GGTTTCGGCG CACTGGGGCG GCAACTGGGG AGCCGATGTG CCCTCCGGCC ACCGGGACCT GGGACGGGCG CTGGTGGATG CCGGGGCCGA CGTCGTGTTC GGACATTCGG CGCATATCTT CCGCGGCGTG GAACTCTACC GCGGCCGGCC CCTCATCTAC AGCGCCGGCG ACTTCATTGA CGACTATGCC GTGGACCCCG GCGAGCGCAA CGACCAGTCG TTCATCTTCT GCATGGAAAC CGCAGGGATC TCGCCTGCCC GGCTGCAGCT CCATCCAACG GTCATCGCCG GGTTCCAGGC GCAACTGGCC GGTAGGGGCG CACGCCACAT CGCCATGCGG ATGAAGGGAC TCTGCGCGCA GCTGGGAACG CAGAGCCGAT GGATCGACGA ATCGAACGTG CTGGAAATAC CGGTGGACAG TTAG
|
Protein sequence | MRLIFLGDVM LGRLVNRQLR TVPPAFPWGD TLPVLAQGDI RFANLECVLA DGGTPQPGKV FHFRSDARNA AVLAAAGIDA VSLANNHVLD YGADAFRETL PALERSGILH AGAGPDLEAA QRPAIRRAGP AAVGLIAFTD NQPDWEAGPD RPGVYYVPVA GRQQDDRRVH DLLALVRRTK ARADLLVVSA HWGGNWGADV PSGHRDLGRA LVDAGADVVF GHSAHIFRGV ELYRGRPLIY SAGDFIDDYA VDPGERNDQS FIFCMETAGI SPARLQLHPT VIAGFQAQLA GRGARHIAMR MKGLCAQLGT QSRWIDESNV LEIPVDS
|
| |