Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_4358 |
Symbol | |
ID | 6160947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | + |
Start bp | 4874421 |
End bp | 4877429 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641667135 |
Product | hypothetical protein |
Protein accession | YP_001793374 |
Protein GI | 171061025 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGG GGGCTGCTGG CCAGCGCATC GTCGCGGCTG CGCTCGACCG CTTGCAGCGG TCCGAGGTCG CAGACCTGCG GTTGACGGCC TTGGCGCTCG ACACCGCGCC GCCACCCGCG CCCACCGCCG GGGAAACCGG CGCGTGGCCG CTGCAGCAGC TGCACCTGAG CGCCAGCGCG GCCGAGCTGC TGGCCCTGCT GCGCACGCGC GCCGACCTGC GCGCCGCCTG GGAGGTGCCG GGCTCGGCGC AGCCCTGGAC GCAGGCGCTG CAGTCGGTGC TGGCGGGCGA CGCGGGCGCC GACACCTTCG GCCGTGCGCT GGCGCGCAGC CTGCTCGCGG CCCGCCACAG CCAGGTCCTG ACCGCCCTGC TGCAGGCCCT GGCGGATGCC GGCGAGGCCG GGGTCGAGGC CTGCCACGTC GTCGTCCAGC TCGGTGACAT CGGCGCCGCC GGGCTGCTGG TCGACGTGCT GGGCCTGCTG CGGCTGCACC TGCCACCGGG CACGCCGATC CACCTGCATG CGCAACTGCC CGAGCCCGAC ACGGCCGGCG AGCGGCCGCT GCCGGCCGCC GAGGCGCTGG CCTGGGTGCA GCTGCAGGAG CTCGACGCCC TGGCGCGTGG CCAGGGCTGG CCCGAGCGGC TGGCCGGCGG CGAAGCCGCG GCGGCGCTGG CCGACGTGCC GGCCTTCGAG CACGGCTGGC TGTCGGGCGC GATCGACGAA GACGGCCTGC AGCAGCTCGA CGCCACCCGG CGCGCCGCCC ACGCCGCCGG CCTGCTGCAG CTGCTGATCG AACAGCCCGG CCTGGTCGAC AGCTTGCGCG CCAGCCCGCC GGGCAGCAGC CGTTTCCAGG CCTGGGGCCA GGCGCAGCTG CGCTGCAACG CCCAGGCCAT CCGCGCCCAC CTGACGCAGG AGCTGCTGCT GCGCCTGCTC AGCCAGCTGC GCCATGACCA CTGGCGGCCC GCGCTCGGCT ACGTCAGCAG CCCGTCGTCG GCCGAGGCGG TGCAGCTGCT CGGCGACTCG CTGCTCGAGG CCTGGGGTGT CTCGCCGGCG CATCTGGGCC TGCAGCTCGG CCTGCCCGAA GTCGGCGAGG CCGACCAGCC GCCCGAGGCC GTCGAGCAGG AATGGCAGGC GCTGATGCAG CACGCCCTCG GCCTGCTCGA CATGGTGCCG CCGGCCGGCC AGGCCGAGGA GCTGCGCCGG CTGATGCAGG AGGCGCATGA CGAACGCTTC CGCGCGGTCG GCGTGCGGGC TGCCTTCGAT CAGCCCGAGC ACCAGCTGCG CAAACGCGCC GTGGCGGTGC GCCAGCGCAT CGAGACCGCC TTGTGGACCG ACTGGCGCGA GGGCCGCCGA TCGCTCAACG GCTGCGGCCA GCTGCTCGCC GCCGCCGTCA GCCACGTGGC ACGGCACGCC GAGGTCATCG ACCGCCAGCG GCTCGAACGC GAAGCCCAGG CGCTGCACCT GACGCAGCTG GCCGAGGCGC GCCTGGCCGA ATCGGCCGGC AGCCGGCGTG CTCGCTGGTC GCCGTTCGGT GCCCGGAGCG AAGACCTGGG CCCGCTGGCC CAGCTGCTGC GCGACGCCGG GCTGGCCCGC ACCCGGGCCA CGCAGGCGGC GCTGGCGATG CGCTACGGCG CGCTGCTGCT CGATCAGCTG ACGGTGCTGC AGGGCATGGT CGACGCCGCC GAGCTGTCGT TGGGCGCCAT GACGCAGACC GCCGACCACG CCGCCGCCGC CGCGTTGCCG CCGAGCGAGC CGGCCGACGG CGCCGCTGAC GCGCTGATCG GCGAGCGCCT CGAATCGCGC CAGCTGATGG CCGACGCCCG CAGCCGGCTG GTGACCGGCG AGGCGGTGCA GCGCGCCCAC GTGGCCGCGC TGCGCTCGGC CGCATTCAAG CGCTGGGGCG ACCGGCCGAG CTTCCGGGCG CTGGCCCAGT GGCTCGACGA GGACGCCAGC CCGGCGGCGC TGCTCGGCCT GTGCGAAGCG CGCCTGCCGG TGTCGGCGGT GCAGTACGTG GTCGACGACG CCTGGGCGGG CCTGGGCCTG GCCTGGCTGG CCGACGCCGA CGGCCGCGAC CGCAACCTGG CCGCCTTGCA GCGCCGCGCG GTCATCGGCC TGGCGCCTGC CGCCGACACC GCCGAGGCGT TGCCGGCAGC CCGCACCGGC TGGTTCGTGC CCAATCGCCT GGCCGAATGC CTGGGCCACG CCCCGGAGCC GGAGCCCGCC GAGGCCCTGG TCGACCGGCC CGCCGAAGAA CGCGTCGAGG CAGGCGCCGA AGCGACGGCT GACGCGGCCT CGGACGCCGC TGCCGGTGCG CCGACCCCAT CTGTCGGGCC CGACGCTGCC CTGCCGCCGG CCCCGCCCGT GCCGCTGGCG CAGCGCCTGC GCGAGATCTG GCCCGCCAGC GCCGACGTGC TGCTCGTCAG CGCCTACCCG GCACTGGCGG TGGTGCGGGT CCAGGCGGCG GCCACGCCGC TGTCGTGGCG GCGCGTGCAG GAGCTGGCGA CCTTGCACCA GGCCTGGCGG CGGCGCCACG GCAGCGCCGC GCGCGGCCTG CAGCTCGACG CCTGGCGCAG CCGGTCGGGC CCGGCCGACC GCGTGGCGCA TGACCCCGGC GAGGTGCAGG CGGTGCTGCT GCTGGCCGAT GCGCTGGGCC TGCTTGAAGA AACCCCGGCC GCCACGCTGG TCCAGCCCGA GGGCCTGCGC CTGACCCATG TCCGGCGCGA TGCCGACGGC TTCGAGTTCG AGCGCATGGC GCTCGGCAGC GGCCTGGTCG ACGCCGCCAA CCGGCATGGC GCCGAGGTGC TCGGCACCCT GTACGACGAC GTGCTCGACC GGATCCTGGC CAGCGCCGCC GAAGCCGGCA CCGCCGAGCG CATCCGGCAG GCCATGCAGA GCCGCATCGA CGGGCTCGCC GGCGGCTGGC CGCCCGAACA GCGCGACGAA CGATCGCGGG CCTGGCACGT GGCCGCCCGC ACCGCCATGA AGATCCTCCG ACAGGACACC CACACATGA
|
Protein sequence | MGLGAAGQRI VAAALDRLQR SEVADLRLTA LALDTAPPPA PTAGETGAWP LQQLHLSASA AELLALLRTR ADLRAAWEVP GSAQPWTQAL QSVLAGDAGA DTFGRALARS LLAARHSQVL TALLQALADA GEAGVEACHV VVQLGDIGAA GLLVDVLGLL RLHLPPGTPI HLHAQLPEPD TAGERPLPAA EALAWVQLQE LDALARGQGW PERLAGGEAA AALADVPAFE HGWLSGAIDE DGLQQLDATR RAAHAAGLLQ LLIEQPGLVD SLRASPPGSS RFQAWGQAQL RCNAQAIRAH LTQELLLRLL SQLRHDHWRP ALGYVSSPSS AEAVQLLGDS LLEAWGVSPA HLGLQLGLPE VGEADQPPEA VEQEWQALMQ HALGLLDMVP PAGQAEELRR LMQEAHDERF RAVGVRAAFD QPEHQLRKRA VAVRQRIETA LWTDWREGRR SLNGCGQLLA AAVSHVARHA EVIDRQRLER EAQALHLTQL AEARLAESAG SRRARWSPFG ARSEDLGPLA QLLRDAGLAR TRATQAALAM RYGALLLDQL TVLQGMVDAA ELSLGAMTQT ADHAAAAALP PSEPADGAAD ALIGERLESR QLMADARSRL VTGEAVQRAH VAALRSAAFK RWGDRPSFRA LAQWLDEDAS PAALLGLCEA RLPVSAVQYV VDDAWAGLGL AWLADADGRD RNLAALQRRA VIGLAPAADT AEALPAARTG WFVPNRLAEC LGHAPEPEPA EALVDRPAEE RVEAGAEATA DAASDAAAGA PTPSVGPDAA LPPAPPVPLA QRLREIWPAS ADVLLVSAYP ALAVVRVQAA ATPLSWRRVQ ELATLHQAWR RRHGSAARGL QLDAWRSRSG PADRVAHDPG EVQAVLLLAD ALGLLEETPA ATLVQPEGLR LTHVRRDADG FEFERMALGS GLVDAANRHG AEVLGTLYDD VLDRILASAA EAGTAERIRQ AMQSRIDGLA GGWPPEQRDE RSRAWHVAAR TAMKILRQDT HT
|
| |