Gene Cag_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1221 
Symbol 
ID3748255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1619191 
End bp1620375 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content44% 
IMG OID637773755 
Productputative transcriptional regulator 
Protein accessionYP_379526 
Protein GI78189188 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACGG AAACTGAAGT GCAATTGCTA CTGAGCAATA TGGAAGCCGA TACCATTGAA 
CGTACAACGG CTGTTGCTGA TACCGATAAG TTTGGGCAAG CAATTTGTGC ATTTGCAAAC
GACCTACCTA ACCATCGCGC CCCAGGTTAT TTGCTTATTG GAGTAAAAGA TAATGGCGAA
CTTTCAGGTT TAACGGTTAC CGATGAGCTG TTAAAAAATC TTGGTGGTAT TCGTTCACAA
GGCAATGTAC TCCCGCAACC CTATATGAAT ATTGCAAAAT TTTCTTTTGC AAGTGGAGAT
GTTGCTGTTG TTGAAGTCTA TCCCTCTGAT TTACCACCAG TTCGTTATAA AGGACGAGTA
TATATTCGTG TTGGACCACG CAAAGGCATT GCGAACGAAC AAGAAGAGCG AGTGTTAACA
GAACGCCGTA TTGCGCTTGC CCGCACATTT GATGCGCGCC CGTGCGGCGA AGCAACCCTA
AGCAATATTG CACTCGGGCA ATTCGATGCC TACCGCCGTG AAGTTATTGA TGCTGAAACT
ATTGCCGCGA ACAATCGTTC AATTGAGCTA CAACTCGCTT CACTGCGCCT ATTCGATCTA
AAATACAACA CTCCAACTCA TGCAGGAATT TTGCTCTTTG GCAAAAATCC CCGCTATTTT
TTGCATGGCG CTTATATCCA ATATTTGCGT TTTCCCAGCA CCGATATAAC CGATATTCCT
TTAGATCAAG CAGAAATTTC AGGCGACCTT TATGTTGTGC TGCGTGAACT TGAAATGCGC
GTTAAGCTGC TGATTCAAAC CTCCATGCGT CAAGTAAGTA CGCTGAAGGA GCAATTACTT
CCTGATTATC CCGAATGGGC TGTTCGAGAA TTGCTGATGA ATGCCGTTAT GCACCGCAAT
TATGAAAGCA ACACGCCTAT CCGATTTTAT GCCTTTAGCG ATCATATTGA AATCCAAAAT
TCGGGGGGAT TGTATGGGGA AGCAACCCCG GAAAACTTTC CCACCTGCAA TAGTTATCGT
AATCCAGTCA TTGCAGAAGC GATGAAATCG CTTGGTTTTG TCAATCGCTT TGGTTATGGC
GTTCAACGCG CCCAAGCGCT ACTTGCACAA AATGGCAATC CACCAGCAAC ATTTGAGTTT
AATGAGCACT CGGTACTGGT AAAAGTTTGG AAACGAATGA AGTGA
 
Protein sequence
MITETEVQLL LSNMEADTIE RTTAVADTDK FGQAICAFAN DLPNHRAPGY LLIGVKDNGE 
LSGLTVTDEL LKNLGGIRSQ GNVLPQPYMN IAKFSFASGD VAVVEVYPSD LPPVRYKGRV
YIRVGPRKGI ANEQEERVLT ERRIALARTF DARPCGEATL SNIALGQFDA YRREVIDAET
IAANNRSIEL QLASLRLFDL KYNTPTHAGI LLFGKNPRYF LHGAYIQYLR FPSTDITDIP
LDQAEISGDL YVVLRELEMR VKLLIQTSMR QVSTLKEQLL PDYPEWAVRE LLMNAVMHRN
YESNTPIRFY AFSDHIEIQN SGGLYGEATP ENFPTCNSYR NPVIAEAMKS LGFVNRFGYG
VQRAQALLAQ NGNPPATFEF NEHSVLVKVW KRMK