Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3662 |
Symbol | |
ID | 4443663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4116621 |
End bp | 4117982 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639691486 |
Product | hypothetical protein |
Protein accession | YP_833137 |
Protein GI | 116672204 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGAAT GGATCATGCT CGGTATCGGC CTTGTCCTCA CGGTGGGTAC CGGCTTCTTC GTAGCGTCCG AGTTCGCGCT GGTCAACCTT GACCGCAACG ACCTTGAAGC CCGCCAGGCC CGCGGCGAGA AACGCCTTGC GCCCACCATC AAGGCCCTCA AGATCACCTC GACGCATCTC TCCGGAGCCC AGCTCGGCAT TACGCTGACC ACGCTCCTCA CCGGCTACAC CTTTGAACCG GCCATCAGCA AAATGCTCAG CGGCCCGTTG CTGGCCGCAG GCCTGCCTGA GTCCGTGGTG CCGGGGATCG GCGCCGTGGC GGGGATCTTC CTGGCCACCA TTTTCTCGAT GGTGATTGGC GAACTCGTCC CGAAGAACTT CGCGCTGGCC CTTCCGTTGG CCACCGCCAA GATCGTGGTC CCCTTCCAGG CCCTGTTCAC CGCCGTCTTC AAGCCCGTCA TCCTGCTGTT CAACAACACT GCCAACGGCA TCATCAGGTC GTTTGGCATC GAGCCGAAAG AGGAGCTTTC CGGCGCGCGC AGCGCCGAGG AACTCAGCTC GCTGGTGCGG CGGTCCGCGC TGGAAGGCGT GCTCGACGTC GACCATGCCG TCTTGTTGCA CCGCACCCTC CGCTTCTCTG AGCACTCCGC TGCCGACGTC ATGACGCCCC GCGTCCGGAT GACGGCGGTG AATGCCGACG ACACGGCGGA GCAGATCGTC ACCCTGGCCA CGTCCACCGG CTACTCGCGT TTTCCGGTGA TCGGACGGGA CCGCGACGAC GTCCTGGGGG TCCTGCACGT CAAGCAGGCG TTCGCTGTTG CCTTGGAAGA ACGTGCCAAT GTGACCGCCG CGAGCCTCAT GATCGACCCG CTGCGGGTCC CCGAATCCAT GGGCGTGGAC ACGCTGCTGG TCCTGCTCCG CAGGCAGGGC CTCCAGGTGG CCATCGTGTC CGACGAGCAC GGCGGAACGG CCGGCATCGT CACCCTCGAA GACCTGGTGG AGGAGATCGT GGGCGAACTC GAGGACGAAC ACGACCGCGC ACGCGTGGGT GTGGTCCGGA TCGGCCGCGC CATCACATTC GACGCCTCGC TGCGCCCGGA CGAACTGCTG GACCGGACAG GGATTGAAGT GCCCGACGGC GAGGAGTACG ACACCATCGC GGGTTTCGTC ACCGACCAGC TGGACCGGAT CCCCGAGCTC GGCGACGAAG TCACCGTGGA TGGCGGAACG CTGCGCGTGG AACGCGTGGT GGGAACCCAC GTGGAGCGCC TCCGCTTCAC GCCGGACGAA TCACAGGAAG CACCCCAGAG CCCGCATGAC CGCATCATCG ACACCCTCAC CTCGGAGCTG ACCCATGAGT GA
|
Protein sequence | MYEWIMLGIG LVLTVGTGFF VASEFALVNL DRNDLEARQA RGEKRLAPTI KALKITSTHL SGAQLGITLT TLLTGYTFEP AISKMLSGPL LAAGLPESVV PGIGAVAGIF LATIFSMVIG ELVPKNFALA LPLATAKIVV PFQALFTAVF KPVILLFNNT ANGIIRSFGI EPKEELSGAR SAEELSSLVR RSALEGVLDV DHAVLLHRTL RFSEHSAADV MTPRVRMTAV NADDTAEQIV TLATSTGYSR FPVIGRDRDD VLGVLHVKQA FAVALEERAN VTAASLMIDP LRVPESMGVD TLLVLLRRQG LQVAIVSDEH GGTAGIVTLE DLVEEIVGEL EDEHDRARVG VVRIGRAITF DASLRPDELL DRTGIEVPDG EEYDTIAGFV TDQLDRIPEL GDEVTVDGGT LRVERVVGTH VERLRFTPDE SQEAPQSPHD RIIDTLTSEL THE
|
| |