Gene Cag_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1575 
Symbol 
ID3747132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2058740 
End bp2060002 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content51% 
IMG OID637774115 
Producthypothetical protein 
Protein accessionYP_379873 
Protein GI78189535 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00249963 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACTC TTACGAAATT GCGCATTCTT TCGGGCGCGG CTCGTTACGA TGCTTCGTGT 
GCGTCGAGTG GGAGCAATCG TAGTGGCGCT TCGTGCGGTA TTGGTAATAC GTCGCAAAGT
GGTATTTGCC ATTCGTGGTC GGATGATGGG CGCTGTATTT CTCTCTTAAA AATTCTCCTC
TCAAACGATT GTTGCTACAA TTGCGCTTAT TGCGTGAATC GTGCCACCAA TCCCGTTGAG
CGTGCCTCGT TTACGGCGCG TGAAGTGGTT GACTTAACGC TTGACTTTTA CCGCCGTAAC
TATATTGAGG GATTGTTTTT AAGTTCGGCG GTTATGCAAA GCCCCGATGC CACTATGGAG
CGTATGGTGG CTGTGGCTGA AACGTTGCGG AGCGAGGAGC GTTTTGGCGG TTACATTCAC
CTGAAAATTA TTCCGGGTGC CAGCAGCGAG TTGGTGCGTA AAGCGGGGCT TTATGCTGAC
CGCATTAGTG TGAATATTGA GCTGCCGTCG CAAGTGTCGT TAGAACGTTT GGCGCCGCAA
AAGCATCGGG CGGCAATTTT AGAGCCGATG GCGCTCATTG GGCGCGAAAT TAACACAAGC
CTTGTGGAGC GTCAGCATAG TCATCGGGCA CCTCGTTTTG CGCCAGCAGG GCAGAGCACG
CAAATGATTA TTGGTGCTAC GCCCGAAAGC GATTTTCAAA TTTTGCGCCT TTCGCAAGGG
TTGTATAAAA AAATGAACCT CAAGCGGGTC TATTACTCGG CTTACGTGCC CGTTAGTGAG
GATAACCGTT TGCCCGTGCT TGCAGCGCCA CCGCTTTTGC GCGAACATCG GTTGTATCAA
GCCGATTGGT TGCTGCGCTT TTATGGCTTT TCGGCTGAAG AAATTTTATC GGAGGAGTTG
CCACATCTCG ATGAGCAATT CGATCCTAAA ACAGCGTGGG CGTTGCGTCA TCCCGAATTT
TTTCCCGTTG ATATTAATCG TGCCGATTAC GCCACGCTCT TGCGGGTGCC GGGCATTGGC
GTTACTTCCG CTAAACGCAT TGTTGCTGCT CGCCGCTTTT CGCTTATAAC GTTTGAAGGA
TTGAAAAAAA TTGGGGTGGT AATAAAGCGG GCGCGTTACT TTATTACCAT GCAAGGGCGC
CGTGTTGAGT GCACCGACTT TTCGCCAACG CTCATTCGTC GTCAGCTCCT TTTAAGCGAA
TCCACAGAAA AGCCCGCTTC ACGGCAGCTT GTGCTCCCAG GACTTGAACC CATCCTCGCA
TGA
 
Protein sequence
MDTLTKLRIL SGAARYDASC ASSGSNRSGA SCGIGNTSQS GICHSWSDDG RCISLLKILL 
SNDCCYNCAY CVNRATNPVE RASFTAREVV DLTLDFYRRN YIEGLFLSSA VMQSPDATME
RMVAVAETLR SEERFGGYIH LKIIPGASSE LVRKAGLYAD RISVNIELPS QVSLERLAPQ
KHRAAILEPM ALIGREINTS LVERQHSHRA PRFAPAGQST QMIIGATPES DFQILRLSQG
LYKKMNLKRV YYSAYVPVSE DNRLPVLAAP PLLREHRLYQ ADWLLRFYGF SAEEILSEEL
PHLDEQFDPK TAWALRHPEF FPVDINRADY ATLLRVPGIG VTSAKRIVAA RRFSLITFEG
LKKIGVVIKR ARYFITMQGR RVECTDFSPT LIRRQLLLSE STEKPASRQL VLPGLEPILA