Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2031 |
Symbol | |
ID | 4445440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2290732 |
End bp | 2291754 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689839 |
Product | hypothetical protein |
Protein accession | YP_831511 |
Protein GI | 116670578 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02778] DNA polymerase LigD, polymerase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.232126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGCG AATCCACCAC CCTCACCGTC GGGGGCCCGA ACGGCCCCCG CGACATGCGG ATCTCAAGTC CCGGCCGTGT CCTATGGCCG GATCTCGGGC TGACCAAGCT GGACCTTGCG CGCTACCTCG CGGAGGTGGG TGACGCCTTT ATCGCGGCGA ACGGCGGACG TCCCGTTGCC CTGCAGCGGT TTTCGGACAA CGTCGACGGC GAGCAGTTCT TCTCCAAAAA TCCACCCAAG GGCACGCCGG ACTTCATCCG GTCGGTGAAG GTGGTCTTTC CGAGTGCGCG TTCTCACCCG ATGCTGGTCC TGGACGAACC GGCCGCCGCC GTCTGGGCTG CCCAGATGAA CACCGTGGTG TTCCACCCCT GGCCGTCGCG TGCCGAAAAC ACGGACAACC CGGACCAGTT GCGGATCGAC CTGGACCCCC AGCCGGGAAC CGACTTCGAC GACGCCATCC CTGCAGCCCT GGAGCTGAAG GAGGTGCTCG CGGAAGCCGG ACTCGCCACC TTTATCAAGA CCTCGGGGAA CCGCGGCCTC CACGTCTATG CGCCGGTGGA GCCGGCTTTT GAGTTCCTGG ATGTCCGCCA CGCAGTCATC GCCGCCGCCC GTGAACTGGA GCGGCGGATG CCGGACAAGG TCACCACGGC CTGGTGGAAG GAAGAACGCG GCGAACGGGT GTTCGTGGAC TTCAACCAGG CAAACCGCGA CCGCACCATC GCCGGCGCCT ACAGCCCCCG TGCACTGGGC CACGCCCCGG TGTCCTGCCC GATCACCTGG GACGAACTGG GCAGTGCGGA CCCGAAGGAT TTCACCATTC TCACCGTCCC CGAACGGCTC CGGACTGTCG GGGACCCGTG GGCGGACATG AACGCCAACC CGGGAAAAAT TGACGTGCTG CTCGAGTGGT GGGAGCGCGA CGTCGGCTCC GGACTGGGGG AGCTTCCGTT CCCGCCGGAC TACCCCAAGA TGCCAGGTGA ACCTCCGCGG GTTCAGCCCA GCAGGGCCCG CAAGAAGGAC TAA
|
Protein sequence | MASESTTLTV GGPNGPRDMR ISSPGRVLWP DLGLTKLDLA RYLAEVGDAF IAANGGRPVA LQRFSDNVDG EQFFSKNPPK GTPDFIRSVK VVFPSARSHP MLVLDEPAAA VWAAQMNTVV FHPWPSRAEN TDNPDQLRID LDPQPGTDFD DAIPAALELK EVLAEAGLAT FIKTSGNRGL HVYAPVEPAF EFLDVRHAVI AAARELERRM PDKVTTAWWK EERGERVFVD FNQANRDRTI AGAYSPRALG HAPVSCPITW DELGSADPKD FTILTVPERL RTVGDPWADM NANPGKIDVL LEWWERDVGS GLGELPFPPD YPKMPGEPPR VQPSRARKKD
|
| |