Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1146 |
Symbol | |
ID | 3747864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1541849 |
End bp | 1543129 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637773679 |
Product | hypothetical protein |
Protein accession | YP_379451 |
Protein GI | 78189113 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01053] zinc finger domain, LSD1 subclass |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00972968 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACCAC TACTTGTTTT TATAACATCA CTTATTATCC TTGCCGCCCT TGCAACAGCG CTTCCTAATC TGCTTATTAA TCCGGGAAAA TTAACAAAAG GGCATAAACA CCTCGACACG AATTGCCTTG CCTGCCATAC CCCATTTCAA AGCGTTAGCT CGGTGCAGTG CATTAGTTGC CATAAGCCAA ACGATATTGG CGTAAAAACA GTAGCTGGTG TTGTTCTCCC TAAAAACAGC AAGAAAGTTC CGTTCCATAA AGGCGTTGCG ACTAACTCCT GCATTGAATG CCATAGTGAT CATAAAGGCA GAATTCCAAC GAAAGCACTG AAGCCATTTA TGCACGCTTC ACTGCCGCAA TCCATTAAAA ACAATTGCAT AAGTTGCCAC ACACAACAAC AACCTGTAGA TGCTTTGCAT AAAAAAGTTT CATCAAATTG TGCTGAGTGT CATACAACAA AACAGTGGAA ACCGGCCACT TTCGATCATA AAAAAATTAC TGCTTCTGTT GCAAAAGAGT GCATCAGTTG CCATAAGAGC GACTTACCAA ATGATAAGCT CCATGCTAAT GTCTCCTCAA ATTGTGCAGA GTGCCATCGC ACAACAAAAT GGAAGCCAGC TACGTTTGAC CACAAGAATC TTGCTGCCTT AGGTGAAAAA TCGTGCATTG CTTGCCATAA GAGCGACTTG CCAAACGATA AGCTCCATGC TAATGTCTCC TCGAATTGTG CAGAGTGCCA TCGCACAACA AAATGGAAAC CGGCTACGTT TGACCACAAG AATCTTGCCA CTTTAGGTGG CAAATCCTGC ATTGCCTGCC ATAAGAGCGA CTTGCCAAAC GATAAGCTCC ATGCTAATGT CTCCTCAAAT TGCGCAGAGT GCCATCGAAC AACAAAATGG AAGCCAGCTA CGTTTGACCA CAAGAATCTT GCCACTTTAG GTGGAAAATC GTGCATTGCT TGCCATAAGA GCGACTTACC AAACGATAAG CTCCATGCTA ATGTCTCCTC AAATTGCGCA GAGTGCCATC GCACAACAAA ATGGAAACCG GCTACGTTTG ACCACAACCG ATACTTCAGG CTTGATAGCG ACCATCGTGT AAGTTGTGCC ACATGCCATA CCGAACAAAA CAATTACAAA AGGTACACCT GTTATGGCTG CCATGAACAT TCGCAAGCAC GTATTGCAGC CGAGCATATC AAAGAAGGCA TTGCAAACTA CAACAACTGC ATGAAGTGCC ATCGCAATGG CAAAGCTGAA GAGAAAGGAG ATGATGATTG A
|
Protein sequence | MKPLLVFITS LIILAALATA LPNLLINPGK LTKGHKHLDT NCLACHTPFQ SVSSVQCISC HKPNDIGVKT VAGVVLPKNS KKVPFHKGVA TNSCIECHSD HKGRIPTKAL KPFMHASLPQ SIKNNCISCH TQQQPVDALH KKVSSNCAEC HTTKQWKPAT FDHKKITASV AKECISCHKS DLPNDKLHAN VSSNCAECHR TTKWKPATFD HKNLAALGEK SCIACHKSDL PNDKLHANVS SNCAECHRTT KWKPATFDHK NLATLGGKSC IACHKSDLPN DKLHANVSSN CAECHRTTKW KPATFDHKNL ATLGGKSCIA CHKSDLPNDK LHANVSSNCA ECHRTTKWKP ATFDHNRYFR LDSDHRVSCA TCHTEQNNYK RYTCYGCHEH SQARIAAEHI KEGIANYNNC MKCHRNGKAE EKGDDD
|
| |