Gene Cag_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0578 
Symbol 
ID3747525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp682281 
End bp683330 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content42% 
IMG OID637773112 
Producthypothetical protein 
Protein accessionYP_378894 
Protein GI78188556 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.518293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AACTATCGCA CCGTGCCGAT TATCAAAAGC TGCTTGGCAG TATTTCAACG 
CTTTACACGT CGGGGCAGCG CCGTGCCTAT CAAGCCGTTA ACTCCGTTAT TACCGAAACC
TATTGGCAGA TTGGGTGTTA CATGGTGGAG TTTGAACAAT GCGGAAACAT TCGAGCTGAA
TATGGTAAAG CGTTACTTGA CAATTTATCT CGTGATTTAA CGTTGCGCCA TGGCAAAGGA
TTTAGTCGCA GCAACATTAT CCGTTTTAGG CAATTTTATC TTGCTTATCC AAAAGGTGCG
AAGCCTTCGC ACCTTTTGAG CTGGTCGCAT TGGGTTGAAT TATTGAAGCT GGATGATCCT
CTCGAACGCA GTTTTTATGA ACAACAAGCC ATTCGAGAAA AATGGTCAGT TCCTGAACTT
CAACGTCAAA AAAAGTCGTC GCTGTTTCTT CGTTTAGCGG CTGGAAAGGA TAAAGCAGCA
ATTTTGCAAC TTGCAGAGCA AGGGCAAATA GTAGAGCAAC CGGCTGATTT ATTACGCGAT
TCGTTTGTTT TTGAATTCTT AAAAATCCCT GAATCCAGCG AGATGGCTGA GCTTGATCTT
GAAAGTCGCT TGTGCGATCA TCTACAACCC TTTTTGTTGG AGCTTGGCAA AGGCTTTACC
TTTGTTGGGC GGCAATACCG CATTCCTATC AACAATAGCA ATTATCGGGT TGACTTGGTT
TTCTATCACC GTATTTTGCG TTGCTTTGTT TTGATTGATT TAAAAATCAA TGAGGTTGAG
CATCACGATA TTGGGCAAAT GAACCTCTAT CTTGGTTATT TTGCGGCTGA AGAAAATACA
CCCGACGACA ACCCACCAAT AGGCATTATT CTTACTCGTC AAAAAGATGA ATTGCTCGTT
GAGTATGCAA CTTATCAAAT GAACAGTCAG CTTTTTGTTC AAAAGTATCA GCTCTATTTG
CCGGATCGCG AAGAGTTGCG GCGAGAAATT GAGCGCGCGT TGTGGGACAT TGAAGAGAGC
AACAGTAATA AGGAAAAAAA GAACGAATGA
 
Protein sequence
MSKELSHRAD YQKLLGSIST LYTSGQRRAY QAVNSVITET YWQIGCYMVE FEQCGNIRAE 
YGKALLDNLS RDLTLRHGKG FSRSNIIRFR QFYLAYPKGA KPSHLLSWSH WVELLKLDDP
LERSFYEQQA IREKWSVPEL QRQKKSSLFL RLAAGKDKAA ILQLAEQGQI VEQPADLLRD
SFVFEFLKIP ESSEMAELDL ESRLCDHLQP FLLELGKGFT FVGRQYRIPI NNSNYRVDLV
FYHRILRCFV LIDLKINEVE HHDIGQMNLY LGYFAAEENT PDDNPPIGII LTRQKDELLV
EYATYQMNSQ LFVQKYQLYL PDREELRREI ERALWDIEES NSNKEKKNE