Gene Cag_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1938 
Symbol 
ID3746698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2469736 
End bp2471013 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content42% 
IMG OID637774473 
ProductSel1 repeat-containing protein 
Protein accessionYP_380229 
Protein GI78189891 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTA TGAAAAAATT TATCACAAGC ATTGTTATAG CAAGTTTAAC GCTGTTGGCG 
ATTAATGGAT TTTGTGAAAC ACCCTCACAA AAACAAATAT CTCAATGGCA ACAAGCTGCT
GCGCAAGGTA ACTCAGAAGC ACAATTAAAT CTTGGTTATG CCTATGATCA TGGAGAAGGA
GTGAAGCAAG ATTATGCGGA GGCGATAAAA TGGTATCGGT TGTCGGCAGC TCAAGGTGAT
GTTAAAGCAC AATTTAATCT TGGGGTGATG TATTATAACG GTGAAGGAGT AAAGCAAGAT
TATGCAGAGG CGATAAAATG GTTTCGTTTA TTAGCAACTC AAGGTGATGC AATAGCACAA
TTTAATCTTG GGGTGATGTA TTATAACGGT GAAGGCGTGA AGCAAGATTA TACAGATGCG
TTGAAATGGT TTCAGTTATC AGCAGCTCAA GGAAATGCAA TGGCACAAAA CAATCTTGGT
GTGATGTATG CTAAAGGTGA AGGCGTGCAG CAAGATTATG CAGAAGCGTT GAAATGGCAT
CGTTTATCAG CAGCACAAGG CAATGCAATG GCACAAAACA ATCTTGGAGC GATGTATTAT
AAGGGTGAAG GAGTCGAGCA AGATTATGTG GAGGCACTAA AATGGTATCG GTTATCGGCA
GCACAAGGAG ATGCGGTTGC GCAATGGATT CTCGGTTTGA TGTACTATGA AGGTCAAGGA
GTAAGGCAAG ATTACGGAGA AGCGATAAAA TGGTATCGTT TATCAGCGGC TCAAGAAGAT
GCGAAAGCGC AATATAACCT TGGCTTGATG TACTACAATG GTGAAGGTGT GAAGCAAGAT
TATGCCGAAG CGTTGAAATG GCATCGTTTA TCAGCAGCAC AAGGCAATGC AATGGCACAA
AACAATCTTG GAGCGATGTA TGCTAAAGGT GAGGGCGTGC AGCAAGATTA TGCAGAAGCG
TTGAAATGGC ATCGTTTATC AGCAGCTCAA GGTGATGCCA CAGCACAAGG TATTCTCGGT
TTGATGTACT GTGAAGGTTA TGGAGTAAGG CAAAATTACG GAGAAGCGCT AAAATGGTAT
CGTTTATCGG CAGCTCAAGG AAATGCAGGT GCACAATACA ATCTTGGTCT GATGTATTAT
AACGGTACAG GTGTTAGGCA GAGTAAAGCA ATTGCAAAAG AGTGGTTTGG CAAAGCTTGT
GATAATGGTT TCCAAGATGG ATGTGATGCA TATCGGGAGT TAAATGAAGC TGGGGCAAAA
ACTAATAGGA GCCGGTAA
 
Protein sequence
MNVMKKFITS IVIASLTLLA INGFCETPSQ KQISQWQQAA AQGNSEAQLN LGYAYDHGEG 
VKQDYAEAIK WYRLSAAQGD VKAQFNLGVM YYNGEGVKQD YAEAIKWFRL LATQGDAIAQ
FNLGVMYYNG EGVKQDYTDA LKWFQLSAAQ GNAMAQNNLG VMYAKGEGVQ QDYAEALKWH
RLSAAQGNAM AQNNLGAMYY KGEGVEQDYV EALKWYRLSA AQGDAVAQWI LGLMYYEGQG
VRQDYGEAIK WYRLSAAQED AKAQYNLGLM YYNGEGVKQD YAEALKWHRL SAAQGNAMAQ
NNLGAMYAKG EGVQQDYAEA LKWHRLSAAQ GDATAQGILG LMYCEGYGVR QNYGEALKWY
RLSAAQGNAG AQYNLGLMYY NGTGVRQSKA IAKEWFGKAC DNGFQDGCDA YRELNEAGAK
TNRSR