Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1739 |
Symbol | |
ID | 6354567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 1910399 |
End bp | 1911277 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642669343 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_001943759 |
Protein GI | 189347230 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAATC CGAATGCACT GAAATGGAGC GATCTGAGAA CCGGCATCTT TTTTCTGCTC GGACTGGGCT TTGCAGCATA TCTCGGTCTG GTGGTCGGCA AAAACAGCAG TCTCTTCACC GGCGTAACGA CCATCAAGGT TCTTTCCCAG GATGTACAGG GACTTGCCGA AAACAACTTC GTATCCGTTT CAGGAAAAAA GATCGGTACG GTTTCAAAAC TTGACTTTGT CACCAACAAC GACTCGCTTT ACGTTGTCGC CGAACTCAGA CTGCGCAACG AGTTCGCGGT GCTCGTCACC AAAGACGCCA AAGCCACCAT CCGCTCGCTG GGCGTGCTCG GCGACAAATA TGTCGATATC ATAACAGGAA AAGGTCCGCA GGTAAAAAAC GGAGATTTCA TCGCTCTTGA TTCAGAAGAC GGCATGGCGG AACTTACCGG CGGAGCCAAC GAAGCCCTGG CGAAAATCAA CGAACTGCTC GACAGGCTCA ACAACGGCAA AGGTGTTGCC GGGAGGCTGG TTTCAGACGA AAAAATGGGT GCGGAACTTG CCGAAACCGT CACCAGCCTG AAAGCCACCT CCGCCGAACT ATCGTCCCTT TCGAAAAAAG CGTCCAGCGG CAACGGACTG CTTCCGAAGC TCATCAACGA CGGAGAACTT GCGAGGAATA CCCAGGAGAC AGTCTCCCGC CTCAACAAGG CTGCCGAGAA AACCGAGGCC CTTATGGCAA AACTGGAGAG CGACCAGGGC ACTTTCGGCC AGCTGCACTC CAATCCGGCC CTTTACAACA ACCTGAACGA GACGCTTGCC TCACTCGATT CGGTGCTGGT CGATCTGAAA AAGAACCCGA AACGGTACGT CAAGTTCTCT GTCTTCTGA
|
Protein sequence | MRNPNALKWS DLRTGIFFLL GLGFAAYLGL VVGKNSSLFT GVTTIKVLSQ DVQGLAENNF VSVSGKKIGT VSKLDFVTNN DSLYVVAELR LRNEFAVLVT KDAKATIRSL GVLGDKYVDI ITGKGPQVKN GDFIALDSED GMAELTGGAN EALAKINELL DRLNNGKGVA GRLVSDEKMG AELAETVTSL KATSAELSSL SKKASSGNGL LPKLINDGEL ARNTQETVSR LNKAAEKTEA LMAKLESDQG TFGQLHSNPA LYNNLNETLA SLDSVLVDLK KNPKRYVKFS VF
|
| |