Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2295 |
Symbol | |
ID | 4445338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2582616 |
End bp | 2584289 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639690104 |
Product | hypothetical protein |
Protein accession | YP_831775 |
Protein GI | 116670842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00478543 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACAGACA GTCAGAAATC CGACGAAACA GTAACAGCCT CCGAAGAAGC CGAGACCGTG GCTCAGGTGG TTGACGACGC ACAGGCAGCT GGCAACGACG CAGCTCCCGA GGCTACGGCA GAGGCGCAGT CCGGTACCCA GGAAGTTTCC GCAACGGAGG CCGAGGCACC CGCCGAAGAC GCCCCCGCGG CCGAGGCACC CGCTCCGGCT CAGGCCGCCG ACGACGCCGC GCAGGCTCAG GCCGCCGACG ACGCCGCGCA GGCTCAGGCC GCCGACGACG CCGCGCAGGC TGAGGCGCCC GCGGCCCCGG CACCGCGTCC GGCGGCCCGC CCGGCGCCCT CACCGGCAGC GTTCGCCGCG CGGCCGAAGG CAGCGTCCTC GCCTGCAGTT CCCGTCCCCG CTCCTGTTTC CTCGGCTGCC TCCCTGGCCG AAGCCGCACG CTGGGGCCGC GTCGAAGGCG ACGGCCACGT CTTCCTGACC ATCGACGGCG AAGAACACCC CGTGGGTCAG TACCCGGACG TCAGCGACGA GGAAGCCCTC GGCTACTTCG CCCGCAAGTA CGATGACGTG GTGGCCCAGA TTGTCCTGCT CGAACAGCGG GTGGGCTCCA AGGCCCCCAC CACCGACATG CAGAAGACCG TGACGCACCT GCGCGAGCAG CTGGCGGAAC GCAACATGGT GGGCGACCTC CGCGCGGCCG AAGCCCGTCT CGATACGCTG TCCACGCAGA TCGCCGAACT CGAGAAGGCC GAAAAAGCCG AACACGACGC CGTGCGCGCC GCCGAGCTGG CCGCACGGGA AGCGATCGTT GCAGAAGCGG AAGAAATTTC CGGCCACGAC CCCGCGCAGA TCCAGTGGAA GACCTCCAGC GCCCGCATGA ACGAGCTCTT CGAAAGCTGG AAGGCGGCAC AGAAGAACGG CGTGCGGCTG GGCCGCAGCA ACGAGGACGC CCTCTGGAAG CGGTTCAGGG CAGCACGGAC GGTCTTCGAC CGCCACCGCC GGGCCTACTT CTCCCAGCTG GACAGCAATA ACTCCGCAGC CAAGGCCGCG AAGGAAAAGC TGATCGCTGA AGCCGAAGCA CTGTCCACCT CAACGGACTG GGGTTTCGCC GCAGGTGAAT ACCGGCGCCT GATGGACGAA TGGAAGGCCT CACCGCGGGC CAGCCGCAAG GACGACGACG CACTCTGGGC CCGCTTCCGC GCCGCCCAGG ACGTGTTCTT CACCTCACGC CAGGCAGCCA ATGACGAGAT CGACCAGGAG TACGGTGCAA ACCTGACCGT GAAGGAAGCG CTCCTGGTCG AGGCGAACGC CCTGCTGCCC ATCAAGGACC TGGGTGCCGC CAAGAAGGCC CTCCAGTCCA TCCGTGACCG CTGGGAGGAA GCCGGCAAGG TTCCCCGCGC GGACATGGGC CGGATCGAAG CCGGGCTTCG GAAGGTGGAG GACGCCGTCC GACAGGCCGA AGACGAAAAC TGGAAGCGGT CCAACCCGGA GACCAAGGCG CGGACCAACA GCGCACTCTC CCAGCTGGAA GCCGCCATCG CAGGGCTGAA GGAAGACCTC GCGAAGGCGG AGCAGGCTGG CGACCAGCGT AAGATCAAGG CCGCCCAGGA GGCCCTCGAG GCCCGCCAGG CATGGCTTGA CCAGATCTCG CGCTCGGCCA GCGAACTGGC ATAG
|
Protein sequence | MTDSQKSDET VTASEEAETV AQVVDDAQAA GNDAAPEATA EAQSGTQEVS ATEAEAPAED APAAEAPAPA QAADDAAQAQ AADDAAQAQA ADDAAQAEAP AAPAPRPAAR PAPSPAAFAA RPKAASSPAV PVPAPVSSAA SLAEAARWGR VEGDGHVFLT IDGEEHPVGQ YPDVSDEEAL GYFARKYDDV VAQIVLLEQR VGSKAPTTDM QKTVTHLREQ LAERNMVGDL RAAEARLDTL STQIAELEKA EKAEHDAVRA AELAAREAIV AEAEEISGHD PAQIQWKTSS ARMNELFESW KAAQKNGVRL GRSNEDALWK RFRAARTVFD RHRRAYFSQL DSNNSAAKAA KEKLIAEAEA LSTSTDWGFA AGEYRRLMDE WKASPRASRK DDDALWARFR AAQDVFFTSR QAANDEIDQE YGANLTVKEA LLVEANALLP IKDLGAAKKA LQSIRDRWEE AGKVPRADMG RIEAGLRKVE DAVRQAEDEN WKRSNPETKA RTNSALSQLE AAIAGLKEDL AKAEQAGDQR KIKAAQEALE ARQAWLDQIS RSASELA
|
| |