Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_2914 |
Symbol | |
ID | 6159997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | + |
Start bp | 3212028 |
End bp | 3214148 |
Gene Length | 2121 bp |
Protein Length | 706 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641665693 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001791944 |
Protein GI | 171059595 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.384683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCC CCGCCGCGCC ATCCGCCCCA ACCCGCCGCG CACGCCTGGG CGCCGCCTGG CGCGAGCGTG CCGCGCAGCC CGGCCGCGAA TCGCGCGACA CCCTGTTCAT GCTCGCGGTG CTGGCCTGGA CGCTGCTGCC GCAAGCCCCG CACGTGCCGC TCTGGTGCAG CCTGTTCAGC GCCGCGGCGC TGCTGTGGCG CGCCGTGCTG GCCTGGCGCG TGCGTCCGCT GCCGCCGCGC TGGCTGCTGA TCCTGCTGCT GCTGGCCTGC ATCGGCGCCA CCGTGCTGAC CTACCGCAGC CTGACCGGCA AGGAGGCCGG CCTGGCGCTG CTGGTGCTGC TGACCGCGCT CAAGACGCTG GAGCTGCGGG CCCGCCGCGA CGCGCTGGTG CTGTTTTTCC TCGGCTTCTT CCTGGTGCTG GCGAACTTTT TGCACTCGCA GTCGCTCGCC ACCGCGGTCG CGATGGTGCT GTCGGTCTGG GGGCTGCTGG CCGCGCTGGT GCTGGCGCAC ATGCCGGTCG GGCGGCCGCG GCTGGCGCAG GTGGCGGGGT TGGCGGGGCG CATGCTGGTG CTGGGCGCAC CGCTGATCGC GGCGCTGTTC ATCTTCTTCC CGCGCATGGC GCCGCTGTGG GGCGTGCCGG GCGAGGCCGG CGGGCGCACC GGCCTGTCGG ATCAGCTGCA GCTCGGCGAC GTGGCCGAGC TGGCGCTGGA CGACAGCACC GCGCTGCGGG TGCGTTTTTT CGGCCCGGCG CCGGCCGCGT CGACGCTCTA TTTCCGCGGC CCGGTGCTGA GCCTGCCCGA CGGCCAGCGC TGGGTCGCGC TGCCGTGGCG GCCGGGTGAC ACCAGCCTGG CGCTGCCGAA TCCGTCGCCG CGCGACACCG CGCTGGCCTA CGAGATGACG GTCGAGCCGC TGCGCGTGTC GACGCTGCCG CTGCTCGAAA CCAGCGTCAG CCGGCCGCAG GTCGACAGCG GCGAGGTCGA GCTCTACGCC CACCCCGACG GCCAGTGGAT CACCGGCCGG CCGATCGCCG AGCGCGTGCG GCTGCAGGCA CAGGCCGCCG TGCGCTCGCG TGCCGCCGCG TCGCCCGAGG GCCTGCTGCC GACCTATCTG CAGCTGCCGC CCGGCCAGCA CCCGCGCAGC ATCGCCTGGG CCCAGGCGCT GCGCAGCCAG CCGGCCTTCG CGCAGGCCAG CCCGAACCAG CTCGCCGATC TGCTGGCGGT GGCGCTGGTC GTGCACATCC GCCAGGGCGG CTACAGCTAT ACGCTGGCGC CGGGCACCTA TGGCGAGCAG CCGCTCGACG CCTTCTGGCT CGACCGCCGC GAGGGCTTCT GCGAGCACTT CGCCACCGCC TTCGTGGTGC TGATGCGGGC GATGGGCGTG CCCGCGCGCA TCGTCACCGG CTACCAGGGC GGCCAGTTCA ACCCGATCGA CGGCGTGCTC GAGGTGCGCC AGAGCGACGC CCACGCCTGG GCCGAATACT GGCGCGCCGG CGACGGCTGG GTGCGCGTCG ACCCCACCGC CGCGGTGGCG CCCGACCGCA TCCAGCGCAG CCAGCGGCTG GCCCCGCCGC GCGGGCTGAT GGCCGGTGCG CTGGCGCAGG TCGACCCGGC GCTGCTGGCG CGCGTGCGCG CGGTCTGGGG TGCGCTCGAC CACCAGTGGA ACCAGTGGGT GCTCAGCCAC GGCCGCCAGC GCCAGCTCGA CCTGCTGCGT GCGCTGGGCT GGCAGTCGCC CGATCTGGCG GATCTGGGCC GGCTGCTGGC GATCGGGCTG GCCGGGCTCG CCCTGCTGGG CGCGGCGTTG GGCGCCTGGC AAGCGCGCCG GGTGCGTCAG CGCGACCCCT GGCTGCGCGC CTACGCCGGC GTGCGCAAGG CGCTCATGCA GCGTGGCATC GACTGCCCGG CGCACCTGCC GCCGCGCAGC CTGGCCGCGG CGCTGCGCCG GCAACACGGC ACGGCCGCCG AGACACTGGC CGCCGCGCTG CTGGCGCTGG AGGCCTGGCG CTACCAGGCG CCGTCCGGTG CGGCGACCGA CCGGCGTACC CTTGGCGAGC TGCGCCGCCG GGCCCTCGCC GCCGCCCGCA CGATGGCGCG TCAGCCCGCT GAAAATCGAT CTTCCACCTG A
|
Protein sequence | MNSPAAPSAP TRRARLGAAW RERAAQPGRE SRDTLFMLAV LAWTLLPQAP HVPLWCSLFS AAALLWRAVL AWRVRPLPPR WLLILLLLAC IGATVLTYRS LTGKEAGLAL LVLLTALKTL ELRARRDALV LFFLGFFLVL ANFLHSQSLA TAVAMVLSVW GLLAALVLAH MPVGRPRLAQ VAGLAGRMLV LGAPLIAALF IFFPRMAPLW GVPGEAGGRT GLSDQLQLGD VAELALDDST ALRVRFFGPA PAASTLYFRG PVLSLPDGQR WVALPWRPGD TSLALPNPSP RDTALAYEMT VEPLRVSTLP LLETSVSRPQ VDSGEVELYA HPDGQWITGR PIAERVRLQA QAAVRSRAAA SPEGLLPTYL QLPPGQHPRS IAWAQALRSQ PAFAQASPNQ LADLLAVALV VHIRQGGYSY TLAPGTYGEQ PLDAFWLDRR EGFCEHFATA FVVLMRAMGV PARIVTGYQG GQFNPIDGVL EVRQSDAHAW AEYWRAGDGW VRVDPTAAVA PDRIQRSQRL APPRGLMAGA LAQVDPALLA RVRAVWGALD HQWNQWVLSH GRQRQLDLLR ALGWQSPDLA DLGRLLAIGL AGLALLGAAL GAWQARRVRQ RDPWLRAYAG VRKALMQRGI DCPAHLPPRS LAAALRRQHG TAAETLAAAL LALEAWRYQA PSGAATDRRT LGELRRRALA AARTMARQPA ENRSST
|
| |