Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1202 |
Symbol | |
ID | 6374879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1300067 |
End bp | 1301266 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642683702 |
Product | internalin-related protein |
Protein accession | YP_001959617 |
Protein GI | 189500147 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.83099 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAACAGA TCACCTATAA CAATTGCCCT GTCTGCAGCT ACCCCCTGTC TGCAGAAAAC GCAGTCTGTC CCCGTTGCGG CAATGACATT CTCGAAGATA TTTCCTCGCT TGATGAACAA ACTCAGGAAA AGCACATAAG GATCATAGAC GAGAAAAAAG CCGCATGGTA CACACGATGT GTCGCCGACA AGCTGAATTG CTGTGAAATG CAGCCGCCGA GTGACGAACA AGCAGAGAGT TCCTGTCATG TTTCCATGGA AAAAGGTGGC AAGCTGGTTG ATTCACCTGT TTTTTCACTT CCGCGCAAAC AACTGCTTGA AGACCGCTCG AAAAGGAGTG AGTGGTGGAA CTCACTGAAT GCTGACTGGA AAGAAGTTGT TAAAAACACC ATTAAGTCAT CCAGCGACCC TTCAGACCAG GAGTTGCTGG ATTTCTTCAC TGTCAAACAC CTGAGATGTG ATAACCGCAG AATTCACAGC CTTGCCCCTG TCAGGGTGCT CGAAAAACTG CAGCAACTGC GGTGTGACGA ATCTCCCATT GAAAACCTTG AACCGCTCAG GGAAATCACA TCACTGCAAA GGCTCTACGC CTTCGACTGT GATTTCTCCT CTCTGGAACC CCTGAGGGGC TTGCTGAGTC TGAAGCTGCT CTGGATATCG AGCACTGAAG TCAGTGACCT TGATCCGGTA AAGCATATGA TCAATCTTGA AGAACTCTAT TGTTCTGAAA CACCTGTCAG CGACCTTTCC CCTCTATCAG AATTGAGCAA GCTTGAAAAA ATAAGCTGCT ACAAAACCGA GATTTCATCA CTCAAACCAC TTGAAAAACT TGAAAATCTC ATAGAGCTTG GCTTCAACAG TACCCTGATT GACGACCTCT CTCCTTTAGC CGACCTTGAA AATCTCGAAT ATCTGCGTTT CAGTCGCACC GATATCAGTT CTCTTGAACC GCTTTCATCG CTGATAAACC TGAGGGAACT GAGCTTTAAT GAAACCTTCG TTGCGTCACT CGAGCCGCTT GCCGGATTGT CTGAACTTGA AGAGATCTCC TTCGCGAACA CAAACGTCAC CACGATAGCT CCGCTGATGC ACCTCACCTA CCTTGAAAAG ATAGAGCTGA CAGCAGGCCA GATAAGGAAT GAAGAACTTG AACAGTTCCT CGAGCTTCAC CCTGATTGCG AAATACTCTT GAAAAAATAA
|
Protein sequence | MKQITYNNCP VCSYPLSAEN AVCPRCGNDI LEDISSLDEQ TQEKHIRIID EKKAAWYTRC VADKLNCCEM QPPSDEQAES SCHVSMEKGG KLVDSPVFSL PRKQLLEDRS KRSEWWNSLN ADWKEVVKNT IKSSSDPSDQ ELLDFFTVKH LRCDNRRIHS LAPVRVLEKL QQLRCDESPI ENLEPLREIT SLQRLYAFDC DFSSLEPLRG LLSLKLLWIS STEVSDLDPV KHMINLEELY CSETPVSDLS PLSELSKLEK ISCYKTEISS LKPLEKLENL IELGFNSTLI DDLSPLADLE NLEYLRFSRT DISSLEPLSS LINLRELSFN ETFVASLEPL AGLSELEEIS FANTNVTTIA PLMHLTYLEK IELTAGQIRN EELEQFLELH PDCEILLKK
|
| |