Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_3047 |
Symbol | |
ID | 6161574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | + |
Start bp | 3365547 |
End bp | 3368396 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641665822 |
Product | phage tail protein |
Protein accession | YP_001792072 |
Protein GI | 171059723 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02242] phage tail protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000000382276 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACGCCA ACGGCGCGCG TTTCTGGCTG CTCGGCGAGG CGCGGCACTT TCCGAGCCGC AGCGCGCTGC ACTGGGACGA CAGCTGCCGT GTGCTGCGGC TGGCCTCGCA GCGCACGCTC GTCAGCAGCC TGACGCCGGC GGCGGCCTTC ACGCTGGCGC AGAGCGCGCT CGAGCGCGTG CCGCGTGCAG TCGACGCGCT CGAAGGCGTG GCGCGCTGGG ACGCGGCGCG CTCGGCGGTG GTCGCCACCA GCCACCTGCC GGGTGACGCG GTGCTGCTGG CGCTCGGCGA GACGCCGACC GACCTGTGTG CCGCGCCCGA CGGCGTGCTC TACATCGCCC TGCCCGGCCG CGTGCTGATG CACGACCTGC GCGGGCGCTG GGCCGATGCG TCGGTCAGCC TCGCAGGCTT CGAACCCTGG CGGCTGGCGC CGGCCGCAGT CGGTGGCGCC TGGGTGCTGG AGCGCAACAC CGGCCGCGTC GCGCGCCTGA GCGGCCGGCC ACTGCGCACG CAGACGCCAC AGCCCGACGA CTACGACGGC CGCACCTTCC GCCCCACGCC CGAAAACCGC TGCCCGCCAC AACTGCGCCT GCTGCCCACG CCCACGCTGG CGCCGATCGA ACGGGTGCTG GCGCTGGCCG GTGGCGCCGA CGGCGTGCTG GCGCTGCTGT CGTGGCGCGA TGGTGAAGGC ACCGCGTGCG TGCGCTGCTG GCGCGAAACG GTCCCGGACG GCAGCATGAC CGGCATCCCG CCGCGCTGGA GCGAGCCGCT GCAGCTGACC GGCGCGAGCT ACGCCTACGC GATCACTTGG CTGGAGGCCG GCCGGCTGGC GGTGCGCGTG CCGGGCCGGC GCGATGCACC AGCGTTCGAT CTGAGCGACG AGCCGACCGG CAGCGTGGCG CCGCTCGGCG ACATCTACCC GCTGGCCGGC GAGAACAACG GCCCGCCGCT CGAAGCGCCG TTCGCCAACG GCGTCGCGCT GCCGCCGCAT TACCCGCGCG GCGAAACCGG CACGGCGCCG CTACACGCGC TGTCGCTGAA CCAGTTGGCG CGCCGCGGCG AAGCGGGCAA CGCAGCCGAC CCCACAGCCC CCGACACGCG CCACTGGCTG ATCGACAGCG GCGACAACAC CACCGTCTGG CACCGCCTGT TCGCCGAGGC AGAAATCCCG CCGCACACCG GTTTCGTGGT CTGGCTGGCG GCCGGCAACC AGGCCCAGCC GCCGGACTGG AACGCCGCGC AGGTCGACCG CGTCAGCTGG CAGCCGCATG GTTTCGGCGC CGACATTCAC CAGCTCGACC CGGCGATGCG CGCCCCGCAA CTGCCGCACG CGGTGTGGGA AAGCGCCGCC AGCGAACTCG CCGGCCACCC CGGCCTGCTG GGCGGCGTTC GCGAGCCAGG CCGGCGCGGG CTGTTCTCGG TGCTGGTGCA GGACAGCCGC CAGCGCGTGC GCCAGCTCGT CGGCCGCTAC CTGTGGGTGC GCGTGGTGCT GCATGGCGAC GGCCGCGCCA GCCCCGAGCT GGCCGCGCTG CGCGCCTGGG GCAGCCGCTT TTCGTACGCC GAGCAGTACC TGCCGCGGCT CTATCGCGAG CAGCTTTACG GCGACGCGGC CGCCGCGCCG GGCCGCCGGC TGGGGCAGAT CGCCGCCAGC TTCGCCGCCG AGCTGGATGC GGCCGGCGCG GCCGATCCGC TCGACCTCGG CGCACCGCTG CTGACCGCGC TCGCCCTCGT CGACATCGAA CCCGGCGCGC TGGCGCGGCT CACGGTCGAG CGCGCCGGCG CCGCTTGGCT GCTGCAGCAG GATCGCCAAG CGTGGCGCCT GCGCGTCGAG GCCGATCCCT TGCGCCCCGA TCAGCTCGAC TCGGTGGCCA TCGGCATCTA CCAGCCGCAG GCCACGCCGG CCGATTTCAA CGCCCGCATG CTGGCCAGTT TCGAAGGCGT GCTGACGCAG CTCGAAGACC GCGTGGCCGC AGCGCATCTG CTGAGCGACC CGGAGGTCGT GCCCGAGGCC AGCCTCGACT GGCTCGGCAG CTGGATCGGC GTGGCCTTCG ACCCCGCGCT GCCGGCGGCG CGTCGGCGCG AGTGGCTGCG CGCTGCGCCC GAGCTGGCGC GCTGGCACGG CAGCCTGCGC GGCCTGCGCG GTGCGCTCGA CGTCGCCACC GGCGGCGCGG TGCGGCAAGG TGGGGTGGTC GTGATCGAGG ATTTCCGGCT GCGCCGCATC CTCGCCACGC TGCTGGGGGT CGACCTGGCC GATGAAGACG ACCCGCTGCT GCCGGGCCTC GCACGCAGCG GTAACTCGAT CGTGGGCGAT ACGCTGGTGA TCGGCGAGAA CGAGAGCGTC GAGCTGCTGG CACTGTTCAA CGAGGCCGTG GCCAGTGCGC GCGAAGACGC CGCGGTGGTC GCTTTTCAAG AACGGCTGGC GCACCGCACC ACCGTGCTCG TCCATCAGGA AGTCGAGCCG CAGGACTTGG CGCTGATCCG CCGCATCGTC CGGCTCGAAG CGCCGGCGCA CGTCGAGGTG CGGGTGGCCA GCGCCACCTG GCCGCTGCTC GTGGGCATTG CCAGCCTGGT GGGCGTGGAC ACCTACCTCG CGCCACCCCG CACGCCGCGG CCCGTGCAGG TTCAGCGCTC GGTGCTGGGC CTGGGCGACT ACCTGATCGG CGCGCCGCTG CTCGACCCAC GCCTGAGCGG CATCGCGCAG CCGGTGCCGC CGCCCGAGGC ACACGCCGGG CCGGACCGCA CCGTCGGCTC GGCCGCCAGT TTCGTGCTCG ACGCCAGCGG CTCGACCGCC GCGCCGGGCC ACCAGATCAC CGAATACCGC TGGCGCTGGC TGGCGCCAGA TCTCACCTAA
|
Protein sequence | MDANGARFWL LGEARHFPSR SALHWDDSCR VLRLASQRTL VSSLTPAAAF TLAQSALERV PRAVDALEGV ARWDAARSAV VATSHLPGDA VLLALGETPT DLCAAPDGVL YIALPGRVLM HDLRGRWADA SVSLAGFEPW RLAPAAVGGA WVLERNTGRV ARLSGRPLRT QTPQPDDYDG RTFRPTPENR CPPQLRLLPT PTLAPIERVL ALAGGADGVL ALLSWRDGEG TACVRCWRET VPDGSMTGIP PRWSEPLQLT GASYAYAITW LEAGRLAVRV PGRRDAPAFD LSDEPTGSVA PLGDIYPLAG ENNGPPLEAP FANGVALPPH YPRGETGTAP LHALSLNQLA RRGEAGNAAD PTAPDTRHWL IDSGDNTTVW HRLFAEAEIP PHTGFVVWLA AGNQAQPPDW NAAQVDRVSW QPHGFGADIH QLDPAMRAPQ LPHAVWESAA SELAGHPGLL GGVREPGRRG LFSVLVQDSR QRVRQLVGRY LWVRVVLHGD GRASPELAAL RAWGSRFSYA EQYLPRLYRE QLYGDAAAAP GRRLGQIAAS FAAELDAAGA ADPLDLGAPL LTALALVDIE PGALARLTVE RAGAAWLLQQ DRQAWRLRVE ADPLRPDQLD SVAIGIYQPQ ATPADFNARM LASFEGVLTQ LEDRVAAAHL LSDPEVVPEA SLDWLGSWIG VAFDPALPAA RRREWLRAAP ELARWHGSLR GLRGALDVAT GGAVRQGGVV VIEDFRLRRI LATLLGVDLA DEDDPLLPGL ARSGNSIVGD TLVIGENESV ELLALFNEAV ASAREDAAVV AFQERLAHRT TVLVHQEVEP QDLALIRRIV RLEAPAHVEV RVASATWPLL VGIASLVGVD TYLAPPRTPR PVQVQRSVLG LGDYLIGAPL LDPRLSGIAQ PVPPPEAHAG PDRTVGSAAS FVLDASGSTA APGHQITEYR WRWLAPDLT
|
| |