Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0892 |
Symbol | |
ID | 3748082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1222701 |
End bp | 1224104 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637773423 |
Product | internalin-related protein |
Protein accession | YP_379200 |
Protein GI | 78188862 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTTC GGCTTTTTTT CTCTATTTTT AAAGAACTTA AAAGCAGCGC GGTTCCTCAG AGTCCTCTTG TTCCTTTAAG TAGTGATGGA ATTTTTTGGT ACCAAAAAGC GCTACTTCCC ATTGCAATTA AGAAAATGGA AGTGCGCCAA AGTACAAACC AGCTCGTTAT CTCCCCAACC TTAAACGCAC TTTTAATTGT GGAGCAACAT ACCTACAACC AATGCCCAGT GTGCGGTTTT CCGCTTTCCA TGAATAGCGC TATTTGCCCT CGTTGTGGCA ACGATATTCT TGAAGATATT TCATCACTCG ATCAACAATC GTTGGAGAGG TATCATAAAC ATCTTGAAAA CAAAAAAGCA GAGTGGTACG CCCGTTGCTT AACCGATCAA ATTACTGGTG GCGACAACCC ACCCTTATCA GCCGAGCACC AAGAGTGCCC AGCAGGACGC CAAAAGCCTC ACGCTTTGTT TAATAGTGAT GACGAGTTGG CATTTTTTAC CTCATTAAAC CGTGCCGATA TTCTGCGCGA CACAAATTTG CGCAAAAAGT GGTGGCAAAG CATTACTGCC GATTGGCAAG ATGTGGTACG TTTCACCTTA AAAATTAATC ACGATCCTTC CGATAGCGAC TTACTTGCTT TTTTTGATAG CACCAATTTG CGTTGTGATG ACCGTCGCAT TCATAGCTTG CTGCCTATTC GCGTACTCGA AAAGCTTCAG CAACTTCGTT GCGATGAATC GCCCATTGAA AGCCTTGAGC CTCTTGCCCA CCTTACCTTG TTGCAGCGAC TTTATGCCTT TGATTGCGAC TTTACCTCAT TGGAACCGCT GCGTAACCTA ACGCATCTTA AACTCCTATG GATTTCAAGC ACCGAAATCA CATCGCTTGA ACCCATTAGT AACCTTATTA ATCTCGAAGA GCTTTATTGC TCCGAAACCG ACATTACCGA TTTAGAGCCA CTTCGGAAGC TTATCAATCT CGAAAAGCTA AGCTGCTACA AAACCAGCAT TACCTCCTTA GAGCCACTTG CTGAACTTGA AAATTTAATT GAGCTGGGCA TTAATCACTC CGATATTAAT GATTTAACCC CTCTTGCAGG GCTTATCAAT CTTGAATACT TGCGCTGTAG TAAAACCGCT ATTAGCAGCT TAGAACCGTT GCGCAACATG GTAGAGTTGC GGGAACTCAG CATTGCTCAT ACCAATGTAG ATTCGTTAGA AGGCTTGCAA GGGTTAGAAA ATCTTGAAGA GCTTGATATT ACGAACACCT TGGTAAGTTC TATTGAACCG CTTATGGGGT TGGAATACAT CGAAAAGCTT GAGCTTTCGG TTGGCACCAT TCCTGACGAA GAGCTTGAGC GCTTTGTAGA ATTACATCCC GATTGCAATG TTGTTGCAAA GTAG
|
Protein sequence | MALRLFFSIF KELKSSAVPQ SPLVPLSSDG IFWYQKALLP IAIKKMEVRQ STNQLVISPT LNALLIVEQH TYNQCPVCGF PLSMNSAICP RCGNDILEDI SSLDQQSLER YHKHLENKKA EWYARCLTDQ ITGGDNPPLS AEHQECPAGR QKPHALFNSD DELAFFTSLN RADILRDTNL RKKWWQSITA DWQDVVRFTL KINHDPSDSD LLAFFDSTNL RCDDRRIHSL LPIRVLEKLQ QLRCDESPIE SLEPLAHLTL LQRLYAFDCD FTSLEPLRNL THLKLLWISS TEITSLEPIS NLINLEELYC SETDITDLEP LRKLINLEKL SCYKTSITSL EPLAELENLI ELGINHSDIN DLTPLAGLIN LEYLRCSKTA ISSLEPLRNM VELRELSIAH TNVDSLEGLQ GLENLEELDI TNTLVSSIEP LMGLEYIEKL ELSVGTIPDE ELERFVELHP DCNVVAK
|
| |