Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2235 |
Symbol | |
ID | 6375929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2420482 |
End bp | 2421801 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642684722 |
Product | protein of unknown function DUF21 |
Protein accession | YP_001960621 |
Protein GI | 189501151 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCCG ATATAGTAGA ACTGCTTATA TTATTCTGCC TGATACTGGC TAACGGATTC TTTTCGATGG CAGAGTTTGC GATCATTTCT TCAAGTGAAG CCAAACTGCA TGAATTACGT GACGCGGGTA TAGCCGGTGC GTCCATGGTT ATCGAGCTGC TTGAGAAGCC CGGAAGATTC CTTTCCGCCA TACAGGTAGG CATCACCCTT ATCGCCACCC TCGCGGGAGC ATTCAGCGGT ATCAGCTTCT CCGAACCTGT CGCGAAAATC ATCAGAGAGA TCGAACCCAT TGCGCCATAC AGCAAAGAAC TGGCTCTCGG CCTGGTTGTT TTGGGGGTCA CCTATGTCAC GCTCATCATC GGCGAACTCG CTCCGAAAAA AATAGCACTG CAGCATCCGG AAAAAATAGC GATCAAAATC GGACGCAGTA TCGACATGAT CTGCAGAATA AGCTCGCCGA TAGTTCACCT TATCAACGGC TCAACCAACA TCGTCCTGAA AATCATAGGC ATCAGAAATA CTGAAAAGCC TCTTGTGAGC GATCAGGAGG TCATGCTCAT GATAAAACAG GGTGCAAAAA AAGGCGTTTT TGAATCGGTT GAATACGAGA TGATTTCAAG AATATTCCGT ATGAGCGACA AGCGGGCAAG TGCAATGATG ACACCGAAAA GCGAGATCGA GTGGCTCGAC CTCAATGCGC CGGATGACTC TTTGAGAGCC AAACTGCTGG CCAGCGGCCG CTCACACTTT CCTGTTTCAG AAGGAGGAGC TGACCACCTG AGAGGCGTTG TACGCTCACT TGACCTGGTC AGCAAGCAAC TGCTCGAACC TGGCAATCTG AAAGCGTCCA TACGCAGTGC GATGAAACCA CCTCTTTTTG TACCGGAATC CGTTCCCGCA TTTCAGGTAC TGGAACTTTT CAAGGAAAAC AGAGCCCATC TCGCGCTGGT CATCGACGAA CACGGCTCGG TACAGGGCGC AATCTCACTC ACAGACGTAC TTGAAAGTAT CGTGGGAGAT ATTCCCGCTG ATGATCTTGA GGGAACAAAA AAAATCGTAC AAAGAAGTGA GCGAACCTGG ATTATTGACG GCCTGCTCCC TGTCGATGAG TTTATCTCTG AATTTGATCT TGACAACTTC CTCGACGAGG ATGATCCCCG ATACGATACC ATGGGAGGAT TTCTCATGAC CAAGCTGGAG AAGATGCCTT CAGTCATGGA TACGCTCGAA TGGCAGAACA TGCTGTTCAA GGTGATCAAA ATGAACGGAC AGCGGGTAGG AAAAATACTG GTGATATTCA CTACCGAAAG TTCGCACTGA
|
Protein sequence | MNSDIVELLI LFCLILANGF FSMAEFAIIS SSEAKLHELR DAGIAGASMV IELLEKPGRF LSAIQVGITL IATLAGAFSG ISFSEPVAKI IREIEPIAPY SKELALGLVV LGVTYVTLII GELAPKKIAL QHPEKIAIKI GRSIDMICRI SSPIVHLING STNIVLKIIG IRNTEKPLVS DQEVMLMIKQ GAKKGVFESV EYEMISRIFR MSDKRASAMM TPKSEIEWLD LNAPDDSLRA KLLASGRSHF PVSEGGADHL RGVVRSLDLV SKQLLEPGNL KASIRSAMKP PLFVPESVPA FQVLELFKEN RAHLALVIDE HGSVQGAISL TDVLESIVGD IPADDLEGTK KIVQRSERTW IIDGLLPVDE FISEFDLDNF LDEDDPRYDT MGGFLMTKLE KMPSVMDTLE WQNMLFKVIK MNGQRVGKIL VIFTTESSH
|
| |