Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1193 |
Symbol | |
ID | 4570061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1349493 |
End bp | 1350695 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639765787 |
Product | internalin-related protein |
Protein accession | YP_911653 |
Protein GI | 119357009 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0638484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGCAGA ACACCTATAA CAAATGCCCG GTATGCAGTT TTCCGCTCTC CCTTGAAAGC GCTGTATGCC CGAGGTGCGG TAATGACATT CTTGAAGATA TCTCTTCTCT TGATCAGCAA AGCGAGGAGC TTCATCGAAA AACAATGGAT GAAAAAAAAG CCGAATGGTA CACCTGGTGT ATAACTGAAA ATCTCAATAT TTCCGATAAC GAACTATCGA CAAAACCTCC TGATAGAGAG AAGACATCAG AGTCCAGACA TCTCTTCTCT ACTCCTGATG AACAGGAATT GCTCCGTACC GCATCAAAAG CGATCCTTCT GAAGGATCAT TCGTTACGAA AAAAATGGTG GCAAGCGCTC AGTGCCGACT GGAAAGAGGT TATTAAAAAC ACCATAAAAA TAGTTCGCGA ACCCAATGAT CAGGAAATCC TTGATTTTTT TCAAACCACT CATTTCCGAT GTGACAACAG AAGGATTCAC GACCTCTGGC CAATACGAAT ACTGGAAAAT CTTGTACAAC TTCGTTGCGA TGAATCGCCG GTTGAAAGTC TTGAACCCCT CGCGCATCTC AGCTCGCTGC AACGAATCTA TGCCTTTGAT TGTGATTTCT CCTCGCTTGA GCCCCTTCGC AAACTGAAAC ATCTGAAATT GCTCTGGATA TCGAGCACCC AGATAAGTAA CCTTGACCCT CTCCGTGAAC TTACCAGTCT TGAGGAGCTG TACTGTTCGG AAACCCTTGT AAAAGATCTG GATGCACTGG CGGAACTGGT CAATCTTGAA AAGCTCAGTT GTTATAAAAC CGAAATCGCA TCTATTGAGC CTCTGGCAAA TCTTTCGAAC CTTATTGAGT TGGGCATCAA CAACTCGAAT GTCACCGATA TCCGACCGCT TTCAAAACTT ACCGGTATTG AATACCTGCG GTGCAATAAA ACAGGAATCA TGAATCTTGA ACCACTGGCA AACCTTGCAG GACTCAGAGA ACTGAGTATT TCAAGAACAC GGGTGGAGAG CCTTGAACCT CTTGCGGAAC TTATGGAGCT TGAAGAACTT GATTTTTCAA ATACGGAAGT ACAATCCATT CTCCCCCTCA TGCAACTCGA AAAACTTGAA AAAATCGAGC TCTCTGCAGG AACGGTTCCT GAAAAAGAAC TGGAAAGATT TATTGAATTG CATCCTGATT GCGAAATTCT TCTGACGCAA TAA
|
Protein sequence | MEQNTYNKCP VCSFPLSLES AVCPRCGNDI LEDISSLDQQ SEELHRKTMD EKKAEWYTWC ITENLNISDN ELSTKPPDRE KTSESRHLFS TPDEQELLRT ASKAILLKDH SLRKKWWQAL SADWKEVIKN TIKIVREPND QEILDFFQTT HFRCDNRRIH DLWPIRILEN LVQLRCDESP VESLEPLAHL SSLQRIYAFD CDFSSLEPLR KLKHLKLLWI SSTQISNLDP LRELTSLEEL YCSETLVKDL DALAELVNLE KLSCYKTEIA SIEPLANLSN LIELGINNSN VTDIRPLSKL TGIEYLRCNK TGIMNLEPLA NLAGLRELSI SRTRVESLEP LAELMELEEL DFSNTEVQSI LPLMQLEKLE KIELSAGTVP EKELERFIEL HPDCEILLTQ
|
| |