Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3663 |
Symbol | |
ID | 4443664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4117975 |
End bp | 4119015 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691487 |
Product | hypothetical protein |
Protein accession | YP_833138 |
Protein GI | 116672205 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAT ACCTTCCCGG CATCATCTGG CTGGTGGTGC TCCTGGTGGT CAACGCCTTC TTCGTCGGGG CCGAGTTCGC TGTCATCTCC GCCCGCCGGT CGCAGGTCGA GCCCAAGGCC GAGGCCGGCA GCAAGGCCGC GAAGACCACG CTGTGGGCCA TGGAGCATGC CACGCTGATG CTGGCCACCA GCCAGCTGGG CATCACCGTG TGCTCGCTCG TGATCCTGAA CGTTTCCGAA CCCGCCATCC ACCACCTGCT GGAAATCCCC CTGGGCCTGA CCTCGCTCTC CGGCGAAGCG ATCGGGATCA TCGCCTTCGT GGCCGCGCTG TTGCTGGTGA CCTTCCTGCA CGTGGTCATC GGTGAAATGG TGCCCAAGAA CATCTCGTTC TCCGTTCCCA CCCGGGCCGC GCTCATCCTT GCCCCGCCGC TGGTGATGGT GTCACGCTTG TTCAAGCCGG TGATCTGGAC CCTTAACGGG ATCGCAAACT CCATCCTGCG GCTCTTCAAG GTCCAGCCCA AGGATGAGGC TACCAGCGCC TACACCCTGG ACGAGGTGGC CAACATCGTG GAGCAGTCCA CCCGGGACGG CATGCTCACG GACACCACCG GCACGCTGAA CGCAGCGTTC GAATTCACCG CCAAGACCGT GGCGGACGTG GAAGTGCCGA TCAGCGAGAT GGTGCTCCTG CCGGCCTCGT CGACGCCGGC GGACATCCAG AGCGCGGTGG CCCGGCACGG GTTCTCCCGC TACATCCTGA CGGACGACGA CGGCGTGCCC TCCGGCTATC TGCACCTCAA GGACGTCATG GACCTGACGT CCCCGGAAAA ATTCGCCAGG CCCGTGCCGG CCAAGAGAAT CCGACGGCTC GCCTCCGCGT TCAGCGGCAG CGACCTCGAG GACGCGCTGG CCACCATGCG CCGCACCGGC GCCCACGTGG CCCGGGTCTT CGACGCGGAC GGGAAGACCA CCGGCGTCCT CTTCCTGGAG GACATCATCG AAGAGCTGGT GGGCGAAGTG CAGGACGCCA CGAGCGCCTA G
|
Protein sequence | MSEYLPGIIW LVVLLVVNAF FVGAEFAVIS ARRSQVEPKA EAGSKAAKTT LWAMEHATLM LATSQLGITV CSLVILNVSE PAIHHLLEIP LGLTSLSGEA IGIIAFVAAL LLVTFLHVVI GEMVPKNISF SVPTRAALIL APPLVMVSRL FKPVIWTLNG IANSILRLFK VQPKDEATSA YTLDEVANIV EQSTRDGMLT DTTGTLNAAF EFTAKTVADV EVPISEMVLL PASSTPADIQ SAVARHGFSR YILTDDDGVP SGYLHLKDVM DLTSPEKFAR PVPAKRIRRL ASAFSGSDLE DALATMRRTG AHVARVFDAD GKTTGVLFLE DIIEELVGEV QDATSA
|
| |