Gene Cag_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1023 
Symbol 
ID3746751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1377302 
End bp1378519 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content47% 
IMG OID637773552 
Productglycosyltransferase-like protein 
Protein accessionYP_379328 
Protein GI78188990 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCCAGC AACGAAAACC TCGCCTACTT TGGGCGAACC TCTATTGCTT GCTTGATTCG 
TCGAGTGGCG CCTCCATTTC GGTACGAGAA ATGCTACGCC AACTTGCTTA CAATGGCTAT
GAGGTTGAGG TAATTGGTGC AACTATTTTC GATGCTGTTA GCGGCATGAG CGCACTTCCA
CCCCAATGGA AAAAGCGTCT TGAAACCACC GATATTCTTG AACTGAACGA TGCTCCTCTT
CGTCATAAAT TGTTGATGAC CAACAGCCAT CAACGCGATG CCGTAACGGC GCTTGAAGAG
GCTAAATGGT ACGAATTTTA TCTCCACACG CTCAATACGT TTAAACCCGA TGTAGTCTGG
TTTTATGGTG GCAGACCGTT TGACTACCTC ATTTCCGACG AAGCCAAACA TCGTGGTATT
CCTGTTGCCG CTTACCTTGT GAATGGCAAC TACACCAAAA CCCGTTGGTG TAGGGATGTT
GATTGCATTA TTACCGATAC GCAAGCGACG GCTGATTATT ACCATCGAAA AAACGGTTTG
ACGTTGACAC CGGTTGGCAA GTTTATTGAT CCAAAGATGG TGGTGGCTGC GGAGCATCTC
CGACGAAATG TTCTTTTTGT AAATCCAACA TTTGAAAAAG GGGCAGCGCT TGTTGTGCAG
ATTGCTTTGC AGCTTGAGCA GCTACGTCCC GACATTCAGC TTGAAGTGGT TGAGTCGCGA
GGAAGTTGGC GAGGCATGGT TGAGTATGTG AGTGCTCGTT TGGGCAAGCC ACGTACTGGA
TTAAGCAATG TGCAGGTTAT GCCGCACAGC CGCAATATGC GTCCGCTTTA TTCGAGAGCG
CGAATGGTGC TGGCACCAAG CTTGTGGTGG GAAAGTGGTT CACGAGTGCT TGCCGAAGCA
ATGCTGAACG CTATTCCTGC CCTTGTTACC GATAATGGAG GAAACCGTGA AATGGTTGGT
GAGGGGGGTA TTGCCATTGC GCTGCCTGCG AACTATCATG CCAAGCCATA TATCGAGTTG
CTGACTTCCG AATTGCTGGA GCAGTTTGTA GCACAGATTA TTTGCTGTTA TGATGATGAG
CAGTTTTATC AAACGCTGGT TGCTCAAGCA ACGCTTTATG GTTGTACCAC GCATCACATA
AGTACAAGCA CTCAGAAACT TCTCAAAGTA TTTGGGAAGT TAATCGCATC ATCTTCTAAA
GAACTATCCT ATAAATAG
 
Protein sequence
MFQQRKPRLL WANLYCLLDS SSGASISVRE MLRQLAYNGY EVEVIGATIF DAVSGMSALP 
PQWKKRLETT DILELNDAPL RHKLLMTNSH QRDAVTALEE AKWYEFYLHT LNTFKPDVVW
FYGGRPFDYL ISDEAKHRGI PVAAYLVNGN YTKTRWCRDV DCIITDTQAT ADYYHRKNGL
TLTPVGKFID PKMVVAAEHL RRNVLFVNPT FEKGAALVVQ IALQLEQLRP DIQLEVVESR
GSWRGMVEYV SARLGKPRTG LSNVQVMPHS RNMRPLYSRA RMVLAPSLWW ESGSRVLAEA
MLNAIPALVT DNGGNREMVG EGGIAIALPA NYHAKPYIEL LTSELLEQFV AQIICCYDDE
QFYQTLVAQA TLYGCTTHHI STSTQKLLKV FGKLIASSSK ELSYK