Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1668 |
Symbol | |
ID | 4075771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1767772 |
End bp | 1768983 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006981 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_613663 |
Protein GI | 99081509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAG AAAACACCAG GTCGGTGGCG GAACTCGCTG CCGAAATCAA AGCCGACCAC GCCAAGAGCG TTGATGCCGT CAAGGCCATT GCCGAGGAAG CCCTGGGCAA AGCTCAGAGC GGCGAAAAGT TGGCGAACGA CCTGAAGGAA AAGGCCGACG AGGCCCTGAC CGAAATGAAC GGCTTCAAAG CGTCTCTGGA CGCGCTGGAG CAAAAGCTGG CGCGCGGCGC GGGCGGCGAA GGTAGCGACG GCGAAAAGTC GCTGGGACAG CGCTTCGTTG AGAGCGAGGG CTTCAAGTCG TTCAAGGACG GCGGCTTTGA CCGTCACAGC AAGGCGAAGC TGGAAACCAA GGCGACCCTG ACCCTGGCGA CCACTGACAC AGATGGCGCC GTTGGCGATG GTGTAGCCCC GACCCGCCTG CCGGGCATCC AGGGCTTGCC GCAGCGCCGC CTGACCATCC GCGATCTGCT GGCGCAGGGC CGCATGGATG GCAACACCAT CGAGTACGTG CAGGAGACCG GTTTCAACAA CAACGCGGCT CCGGTGGCCG AAGGCGCTGC AAAGCCGTCC TCAGATATCA AGCTGGACGT GAAAACCACC ACTGCCAAGG TGATCGCGCA CTGGATGAAG GCATCGCGCC AGGCGCTGGA TGATGTTTCC GCCCTGCGCT CGATGATCGA CCAGCGCCTG CTGTTCGGCC TGGCGCTGGC GGAAGAAAAC CAGCTTTTGA ACGGTGACGG CACCGGCCAG AACCTGTCCG GCCTGATCAC CAACGCCACA GCCTATTCGG CGGCGTTTGC GCCGGCATCC GAGACCGCAA TCGACAAGAT GCGCCTCGCC ATGCTGCAAG CGGCTCTGGC TGAATACCCG GCAACGGGAC ACGTGATGCA CCCGACCGAC TGGGCACGGA TCGAGCTGAC CAAGGACGGC AACGCCAACT ACATCATCGG CAAGCCGCAA GGCACCATCG CGCCGACCCT CTGGGGCCTG CCGGTTGTGG CTACACAGGC GATCACCGTG GACAAGTTCC TGACCGGTGC GTTCAACATG GGTGCTCAGA TCTTCGACCG CTGGGATGCG ACGGTCGAAA CCGGCTACGA GAATGACGAC TTCACCAAGA ACCTCGTCAC CATCCTGGCC GAGGAGCGTC TGGCGCTGGC GATATTCCGC CCCGAAGCGT TTATCTACGG CGATCTGGGC TACGTGGCCT AA
|
Protein sequence | MAEENTRSVA ELAAEIKADH AKSVDAVKAI AEEALGKAQS GEKLANDLKE KADEALTEMN GFKASLDALE QKLARGAGGE GSDGEKSLGQ RFVESEGFKS FKDGGFDRHS KAKLETKATL TLATTDTDGA VGDGVAPTRL PGIQGLPQRR LTIRDLLAQG RMDGNTIEYV QETGFNNNAA PVAEGAAKPS SDIKLDVKTT TAKVIAHWMK ASRQALDDVS ALRSMIDQRL LFGLALAEEN QLLNGDGTGQ NLSGLITNAT AYSAAFAPAS ETAIDKMRLA MLQAALAEYP ATGHVMHPTD WARIELTKDG NANYIIGKPQ GTIAPTLWGL PVVATQAITV DKFLTGAFNM GAQIFDRWDA TVETGYENDD FTKNLVTILA EERLALAIFR PEAFIYGDLG YVA
|
| |