Gene Cag_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0101 
Symbol 
ID3747589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp115674 
End bp116975 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content44% 
IMG OID637772627 
Producthypothetical protein 
Protein accessionYP_378422 
Protein GI78188084 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.292641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGG CATTTGTAAA AATATGGGGC GAATTAGTGG GAGCGGTAGC TTGGGATGAT 
GCTACCGGCT ATGCCACGTT TGAATATGAT GCCAAATTCA AGAGTAAAGG CTGGGAATTG
GCTCCCTTAC AAATACCGGT AAATGCAACC AAAAGCAACT TTAGTTTTCC TGCCCTGCGT
AAAAAGGCGG ATCCTGCTTT AGATACCTTC AAAGGCTTAC CCGGCTTATT AGCCGATATG
CTGCCCGATC GTTACGGAAA TGAGCTCATC AACTTGTGGT TGGCTCAAAA GGGTCGTCCG
TTAGACAGCA TGAATCCTGT AGAAACTTTG TGCTTCATAG GCACTCGGGG AATGGGTGCC
TTGGAGTTTG AACCCACCAC CTTAAAGGAA AGCAAAAAAG CCTTTTCGCT GGAAATTGAT
AGCTTGGTTG AGATAACTCA AAAAATGCTC ACCAAAAAAG AAGCATTCGT AACCAACCTG
CAGGAAAACG AAGAAAAAGC CATTCTTGAA ATACTACGCA TTGGAACATC TGCCGGTGGT
GCTCGGCCTA AGGCAGTGAT TGCTTACAAC GAAAGAACAG GTGAAGTACG ATCTGGTCAA
ACCAATGCGC CACAGGGGTT TGAGCATTGG CTGCTAAAGT TGGATGGGGT GAGTGAGGTG
CAGTTGGGCG CAAGTCATGG GTATGGCCGG GTGGAAATGG CGTACTACAA CATGGCTGTA
GCTTGTGGCA TTCAGATAAT GCCTTCCAGA TTATTGGAAG AAAACGGCAG GGCACATTTT
ATGACCAAGC GTTTTGACCG TGAAGGCGGT GCAGCCAAAC ACCATATTCA AACCTTTTGT
GCCATGAAGC ACTTTGATTA CAATCTTGTA ACTAATTTTA GTTACGAGCA GTTGTTTCAA
ACGATGCGGG AACTAAAGCT ATCCTATCCG GATGCTGAGC AGTTGTTTCG CAGGATGGTA
TTCAATGTAG TAGCCCGTAA CTGCGATGAC CATACAAAGA ACTTCGCCTT CCGGTTAAAA
AAGGATGGAA AATGGGAACT GGCTCCGGCC TATGATGTTT GCCATGCCTA TCAACCCAAA
CATCAATGGG TAAGTCAACA TGCTTTAAGC ATCAATGGCA AACGAACTAA TATTACTAAA
GACGATTTGC TCACCATTGG CAAATCCATC AAAAATAAAA AGGCTGCAGA AACCATTGAG
GAAATCAGTA ACACAATAAG CCAATGGAAA ACCTTTGCCG ATGAAGTAAA GGTGTTACCC
AAACTGCGTG ATGAAATAGC CGCTACATTG ATTCGATTAT AA
 
Protein sequence
MKTAFVKIWG ELVGAVAWDD ATGYATFEYD AKFKSKGWEL APLQIPVNAT KSNFSFPALR 
KKADPALDTF KGLPGLLADM LPDRYGNELI NLWLAQKGRP LDSMNPVETL CFIGTRGMGA
LEFEPTTLKE SKKAFSLEID SLVEITQKML TKKEAFVTNL QENEEKAILE ILRIGTSAGG
ARPKAVIAYN ERTGEVRSGQ TNAPQGFEHW LLKLDGVSEV QLGASHGYGR VEMAYYNMAV
ACGIQIMPSR LLEENGRAHF MTKRFDREGG AAKHHIQTFC AMKHFDYNLV TNFSYEQLFQ
TMRELKLSYP DAEQLFRRMV FNVVARNCDD HTKNFAFRLK KDGKWELAPA YDVCHAYQPK
HQWVSQHALS INGKRTNITK DDLLTIGKSI KNKKAAETIE EISNTISQWK TFADEVKVLP
KLRDEIAATL IRL