Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0102 |
Symbol | |
ID | 3744873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | - |
Start bp | 112974 |
End bp | 114662 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637768135 |
Product | hypothetical protein |
Protein accession | YP_374035 |
Protein GI | 78185992 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.037129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCATC GTCATCTCAA GCGTGAGCGG TGCAATAGGT ACCGCCCCGC ACTCCTTTTC ACCACATTCA GCATCGTACA GCTTGCGCTC CACCTGCTGG CGCTCGTAGA CTCACCTGAT CTGCGCTTGC TTATGCCACA TCTGGTGGCA TGGACTCATG ACCTACTGCT CCTTTCGATT GTGCTCATCT GCTGTAGGCT TGCCTTATCC GGGCTGCCGT CCAAACTGCG TTCACGTACC GAACTCGTGA CATTACCGAT CATAGCCATC ACCATTTTGC CGCTATCCCT CTACCCCCAG ATGCTTCGCG AGTATCTCTC CTTTCCCATA AACATTTTTT CAGCCACACC TGCCTCGACG TCTGCACTGC TGACCAGTTA CCTGGGCCTG TCACGGCTCT TTCCGCTGAT AGCGGCATCG GCATCCTTAC TTCTGGCAAT GCTGATGCCT CCGCTCCCAA TATGGCATGA GCGCTTGAAA CGCCCGCTAT CAGCCCTCTG GACCATCATC ATAGCAACCG GACTGCTGAC GCTATCCCGC TCACCTCATC CAGTGGTCAA CAGCCTGAAA GAAGAACTTT CGAGCGCACT CTGGCATGAA CGTCGCGAAG TGCCGGAACT GCATCCGGCG ACACAATGGC CGGATTCCGC CCCACTGTCC AGTAGCAGTG TGAGCTCGCT GAACGGAACG CTGAGGGCCG GCCATGTCTA CCTGATTGTT CTGGAAGGAG TAAGTTCCGA TCAGTTCGAG AAAACCCTGT CAGCAAAGAG ATCGGTTTTT TTGGGCCGCA TCGCGAAGCA TGCAAGATAT TTTGACCGAT ATTACACAAC CAATCTTGAT TCCTATACCA GCCTGATTGC AATGCTCACT TCAGAGCAGG TTCCTTACCG TGCCTATACA GATACCGGTT TATACGAAAA CGTCAATAGA GCGCCCAATC TGGTGCGTAG CTTTAACGCC GCCGGATTCC ATACCCTGTT CATCAGCACC TATGACGACC AGCCCTTCAT TCCGGTTCGC CACGAATGGT CGAGCATCAT GCACAGGCAA GATCTTCCTG CCTGGAAAAC GTGGGTATCG GTAGAATCGA GCCGTATGGA ATCAGCCACT GAAGACCGTG CGGCGCTGTC AACTATTGCA GCACTCCCCC GGCAGCACGA CAGAACGTTC ATACTCCACG AGCTGGCCTA CGGGCATACG ACGGAGTGGC GAGCAAAGAC GGGAATTCCC CAACTCGCAT ATTATGACAC CTACCTGAAT GAACTGCTCG AACTCCTCAT GGGGCAGGCC ACATGGCAGA AAAGCCTCCT TGTGATTGTC TCGGATCATG GAGACCGAGC CAGCGCGGAA AGCACCGGAA GTTACCGGGT ACCGCTTCTG ATCGTCGGGC CGGGAGTCGA ACCAGGACGG GACCATGCGC TCCGCTCACA TCTGGACCTG CAGCATATCA TCGCATCCGT CATGACAAGC AGGAACATAC CGCAGCCAAG AGAGGACGCG ATTCTGGTTG GATCGACCGG GCGCTGGATC TATGGATTGA TAGATGCCGG CGGCAATCAT CTCATGGTCG ATGACCATTC GGGCAAGGTG CTCGCATCCC GTGGCGGGAT CAACCCATTG GTTGTACAGG ATAGATTCCA GAAAATCATC AACCATTTCA GCAGACGATT CGACCCTAAA CGGCAGTAG
|
Protein sequence | MHHRHLKRER CNRYRPALLF TTFSIVQLAL HLLALVDSPD LRLLMPHLVA WTHDLLLLSI VLICCRLALS GLPSKLRSRT ELVTLPIIAI TILPLSLYPQ MLREYLSFPI NIFSATPAST SALLTSYLGL SRLFPLIAAS ASLLLAMLMP PLPIWHERLK RPLSALWTII IATGLLTLSR SPHPVVNSLK EELSSALWHE RREVPELHPA TQWPDSAPLS SSSVSSLNGT LRAGHVYLIV LEGVSSDQFE KTLSAKRSVF LGRIAKHARY FDRYYTTNLD SYTSLIAMLT SEQVPYRAYT DTGLYENVNR APNLVRSFNA AGFHTLFIST YDDQPFIPVR HEWSSIMHRQ DLPAWKTWVS VESSRMESAT EDRAALSTIA ALPRQHDRTF ILHELAYGHT TEWRAKTGIP QLAYYDTYLN ELLELLMGQA TWQKSLLVIV SDHGDRASAE STGSYRVPLL IVGPGVEPGR DHALRSHLDL QHIIASVMTS RNIPQPREDA ILVGSTGRWI YGLIDAGGNH LMVDDHSGKV LASRGGINPL VVQDRFQKII NHFSRRFDPK RQ
|
| |