Gene Cag_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0831 
Symbol 
ID3746830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1157520 
End bp1158569 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content45% 
IMG OID637773361 
ProductAsnC family regulatory protein 
Protein accessionYP_379140 
Protein GI78188802 
COG category[S] Function unknown 
COG ID[COG3177] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000889639 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAACA CATCAACCAT CCATATTACG CCAGAAATGT TGAGCTTAAT TGCTCACATT 
GATGAATTTA AAGGCGCATG GCGAGCCTTG GGAGTGCTTG CACCCGAACG TCTTTCGGCG
CTTCGCCGAG TGGCTACTAT TGAAAGCATT GGCTCCTCAA CCCGTATTGA AGGCAGTAAA
CTTTCTGACC AAGAAGTGGA ACGTTTGCTT TCTAACCTAA CCATCAACAC CTTTGAAACG
CGCGATGAAC AAGAAGTTGC GGGTTACGCC GAATTAATGG AATTGCTGTT TACCTCATGG
CAGTACATTC CCTTTAACGA AAATCACATC AAGCAATTCC ATCAACTTTT ACTAAGCCAC
AGCTCCAAAG ATGCTCGCCA TCGAGGCACC TACAAAACCA CATCAAACAG CGTTGCCGCA
TTTGACGAAA ATGGCAAGCA ACTTGGTATT GTCTTCCAAA CAGCTTCACC GTTCGATACA
CCATTTTTAA TGCAGGAATT AATTGCATGG GTAAACCAAG AGCGTGAAGC CAAGCAACTG
CATCCGCTCC TCATTATTGC AATTTTTGTG GTTGTGTTTT TGGAAATCCA CCCGTTTCAA
GATGGCAATG GAAGACTTAG TCGTGCATTA ACCACATTGC TTTTGCTACA AGCAGGATAT
GCTTATGTGC CATATAGCTC ATTGGAAAGC GTGATTGAAA GCAACAAAGA GGCGTACTAT
GTAGCATTGC GCCAAACGCA AGGTACCATC CGAAGCGAAG TTCCAAATTG GCAAGCGTGG
CTTCTCTTTT TTTTACGCTC ACTTGTAGAG CAGGTGCACC GCTTACAAAA CAAGATTGAG
CGGGAACATG TTGTACTGGC CGCACTACCA GAGCTTGCCT TGCAAATTGT TGAATTTGTG
CATCAACATG GGCGCATCAC CATAGGCGAA GCTGTAAAAC TTACGGATGC AAACCGTAAT
ACTTTAAAGG TTCACTTCCG TAAGTTAGTG GAACAGGGGT ATTTAAAACA GCAAGGAAGT
GGGCGTGGGG TTTGGTATGA AAGGGGATGA
 
Protein sequence
MLNTSTIHIT PEMLSLIAHI DEFKGAWRAL GVLAPERLSA LRRVATIESI GSSTRIEGSK 
LSDQEVERLL SNLTINTFET RDEQEVAGYA ELMELLFTSW QYIPFNENHI KQFHQLLLSH
SSKDARHRGT YKTTSNSVAA FDENGKQLGI VFQTASPFDT PFLMQELIAW VNQEREAKQL
HPLLIIAIFV VVFLEIHPFQ DGNGRLSRAL TTLLLLQAGY AYVPYSSLES VIESNKEAYY
VALRQTQGTI RSEVPNWQAW LLFFLRSLVE QVHRLQNKIE REHVVLAALP ELALQIVEFV
HQHGRITIGE AVKLTDANRN TLKVHFRKLV EQGYLKQQGS GRGVWYERG