Gene Cag_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1146 
Symbol 
ID3747864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1541849 
End bp1543129 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID637773679 
Producthypothetical protein 
Protein accessionYP_379451 
Protein GI78189113 
COG category 
COG ID 
TIGRFAM ID[TIGR01053] zinc finger domain, LSD1 subclass 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00972968 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACCAC TACTTGTTTT TATAACATCA CTTATTATCC TTGCCGCCCT TGCAACAGCG 
CTTCCTAATC TGCTTATTAA TCCGGGAAAA TTAACAAAAG GGCATAAACA CCTCGACACG
AATTGCCTTG CCTGCCATAC CCCATTTCAA AGCGTTAGCT CGGTGCAGTG CATTAGTTGC
CATAAGCCAA ACGATATTGG CGTAAAAACA GTAGCTGGTG TTGTTCTCCC TAAAAACAGC
AAGAAAGTTC CGTTCCATAA AGGCGTTGCG ACTAACTCCT GCATTGAATG CCATAGTGAT
CATAAAGGCA GAATTCCAAC GAAAGCACTG AAGCCATTTA TGCACGCTTC ACTGCCGCAA
TCCATTAAAA ACAATTGCAT AAGTTGCCAC ACACAACAAC AACCTGTAGA TGCTTTGCAT
AAAAAAGTTT CATCAAATTG TGCTGAGTGT CATACAACAA AACAGTGGAA ACCGGCCACT
TTCGATCATA AAAAAATTAC TGCTTCTGTT GCAAAAGAGT GCATCAGTTG CCATAAGAGC
GACTTACCAA ATGATAAGCT CCATGCTAAT GTCTCCTCAA ATTGTGCAGA GTGCCATCGC
ACAACAAAAT GGAAGCCAGC TACGTTTGAC CACAAGAATC TTGCTGCCTT AGGTGAAAAA
TCGTGCATTG CTTGCCATAA GAGCGACTTG CCAAACGATA AGCTCCATGC TAATGTCTCC
TCGAATTGTG CAGAGTGCCA TCGCACAACA AAATGGAAAC CGGCTACGTT TGACCACAAG
AATCTTGCCA CTTTAGGTGG CAAATCCTGC ATTGCCTGCC ATAAGAGCGA CTTGCCAAAC
GATAAGCTCC ATGCTAATGT CTCCTCAAAT TGCGCAGAGT GCCATCGAAC AACAAAATGG
AAGCCAGCTA CGTTTGACCA CAAGAATCTT GCCACTTTAG GTGGAAAATC GTGCATTGCT
TGCCATAAGA GCGACTTACC AAACGATAAG CTCCATGCTA ATGTCTCCTC AAATTGCGCA
GAGTGCCATC GCACAACAAA ATGGAAACCG GCTACGTTTG ACCACAACCG ATACTTCAGG
CTTGATAGCG ACCATCGTGT AAGTTGTGCC ACATGCCATA CCGAACAAAA CAATTACAAA
AGGTACACCT GTTATGGCTG CCATGAACAT TCGCAAGCAC GTATTGCAGC CGAGCATATC
AAAGAAGGCA TTGCAAACTA CAACAACTGC ATGAAGTGCC ATCGCAATGG CAAAGCTGAA
GAGAAAGGAG ATGATGATTG A
 
Protein sequence
MKPLLVFITS LIILAALATA LPNLLINPGK LTKGHKHLDT NCLACHTPFQ SVSSVQCISC 
HKPNDIGVKT VAGVVLPKNS KKVPFHKGVA TNSCIECHSD HKGRIPTKAL KPFMHASLPQ
SIKNNCISCH TQQQPVDALH KKVSSNCAEC HTTKQWKPAT FDHKKITASV AKECISCHKS
DLPNDKLHAN VSSNCAECHR TTKWKPATFD HKNLAALGEK SCIACHKSDL PNDKLHANVS
SNCAECHRTT KWKPATFDHK NLATLGGKSC IACHKSDLPN DKLHANVSSN CAECHRTTKW
KPATFDHKNL ATLGGKSCIA CHKSDLPNDK LHANVSSNCA ECHRTTKWKP ATFDHNRYFR
LDSDHRVSCA TCHTEQNNYK RYTCYGCHEH SQARIAAEHI KEGIANYNNC MKCHRNGKAE
EKGDDD