Gene Cag_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1932 
Symbol 
ID3747307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2463544 
End bp2464914 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content47% 
IMG OID637774467 
ProductCBS 
Protein accessionYP_380223 
Protein GI78189885 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0017665 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAT TTCTACTTTT TGTTCTCATA CTTGTCAATG GCGCGTTTGC CATGTCGGAA 
ATTGCGTTAG TAACAGCTAA ACGTTCTCGA CTTTCGCGCC TTGCGGACGA TGGGGATAAA
TCTGCCACTA CGGCAATGAA GTTGGGGGAA GATTCCACCA GCTTTCTTTC CACCATTCAA
ATTGGCATCA CCTCCATTGG TATTCTCAAC GGTATTGTTG GAGAAGGGGC GTTAGCCGTT
CCCTTTTCAC TATTCATTCA TTCGGCAACA GGCATTGAGT TAGAAACCGC TCAGCTTATT
GCTACCGTTG TTGTGGTGCT TGGCATTACC TACGTTACCA TTGTGGTGGG TGAATTGGTA
CCAAAACGGC TTGGGCAGCT TAATCCCGAA CAAATTGCCT GTTTGGTTGC TCGCCCCATG
CAAATACTTG CCACAATTAC TCGTCCTTTT GGGCGCTTGC TCTCCTTTTC AACCAACACG
TTGCTTCGTT TAATGGGGGT TAAACCGCAA ATTACCCCAA GCGTTACTGA AGAGGAAATT
CATGCTATGC TTGAAGAGGG TTCCGAAGCA GGGGTGATTG AACAGCAAGA GCGCGATATG
GTGCGCAATG TGTTTCGCCT TGACGACCGC CAGCTTGGTT CGCTCATGGT ACCTCGTGCC
GATATTGTTT TTCTTGATGT AACTCAACCG CTTGAGGAGA ATATTTGCCG TGTAACGGAG
TCGGAGCATT CGCGTTTTCC TGTATGCAAC GGTAATCTTC AATCGCTACT TGGCGTGGTG
AATGCAAAGC AGTTGTTGCT TAAAACCTTG CGCGGCGGTT TAACCGAATT TGCAACACTC
TTGCAGCCAT GTGTGTATGT GCCCGAAACG CTTACGGGCA TGGAATTGCT TGACCACTTT
AGAACTTCTG GTACGCAAAT GGTTTTTGTG GTTGATGAGT ATGGTGAAAT TCAAGGCTTA
GTTACCTTAC AAGATTTGCT TGAAGCTGTA ACGGGTGAAT TTGTGCCGCG CAACCTTGAA
GATTCATGGG CAGTTGAGCG TGCCGATGGT TCGTGGTTGC TTGATGGCTT AATTCCCGTG
CCTGAATTAA AAGATACGCT CAAGCTTAAA GAGGTGCCCG ATGAGGATAA GGGGCTTTAC
CACACCTTAA GTGGAATGAT TATGTGGTTG CTTGGTAGAA TGCCGCATAC GGGCGATGTG
CTTGTTTGGG AAGAGTGGAA TTTGGAAATT GTTGACCTTG ACGGGCAGCG CATTGATAAA
GTGCTTGCTT CACCACTCAA CAATGCACCA AAAGCATCTC AAAAAGAGGA AAAGCCGCCC
GTCAAATCAG ACGATAATGC GGCTTTGTGT TCCATACGCC CAACGCCGTA A
 
Protein sequence
MEIFLLFVLI LVNGAFAMSE IALVTAKRSR LSRLADDGDK SATTAMKLGE DSTSFLSTIQ 
IGITSIGILN GIVGEGALAV PFSLFIHSAT GIELETAQLI ATVVVVLGIT YVTIVVGELV
PKRLGQLNPE QIACLVARPM QILATITRPF GRLLSFSTNT LLRLMGVKPQ ITPSVTEEEI
HAMLEEGSEA GVIEQQERDM VRNVFRLDDR QLGSLMVPRA DIVFLDVTQP LEENICRVTE
SEHSRFPVCN GNLQSLLGVV NAKQLLLKTL RGGLTEFATL LQPCVYVPET LTGMELLDHF
RTSGTQMVFV VDEYGEIQGL VTLQDLLEAV TGEFVPRNLE DSWAVERADG SWLLDGLIPV
PELKDTLKLK EVPDEDKGLY HTLSGMIMWL LGRMPHTGDV LVWEEWNLEI VDLDGQRIDK
VLASPLNNAP KASQKEEKPP VKSDDNAALC SIRPTP