Gene Cag_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1863 
Symbol 
ID3747015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2371911 
End bp2373047 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content49% 
IMG OID637774400 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_380156 
Protein GI78189818 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAT ATAAGCTGCT TTGGTTTTCT GAAATACAGT GGGATTTTCT TTCAACCCGC 
AAACAACGCT TGTTGGCACG CTTTCCTGAC GAGTGGCATA TCTTATTTAT TGAACCCTTT
ACGCTTGGGC GAAAACATCA TTGGTTGCCC GTAAAGCGAG GGCGCGTGTG GGTAGTTACC
GTGCCGTTCC TTAAAACTAT TCCGTTTCGC TTTGGTGCTT TACTGAAGCG CCCCTTAGTG
CGCACGCTTG CGGGATTGCC GGGCATTGCC ATCATGCACC TTTGGACGCT GTTGCTGGGC
TTCAGTTCAT CACAACGTAT TATTGCGCTC AGCAATCCCT ATTGGGGGAA GGTTGCCTCA
CACCTCCCCT GCCGATTCCG CTGTTACGAT GCCAACGATG ACCATCTTGC CTTTCCCTCC
ACTCCCTCTT GGTTACCTGA TTGGCTTCAA CGCTACCTTT CAACAACATC GTTGGTTTTT
AGTGTCAGCA AAGAACTGAC GGCTCGGCTT CCACTCTCTT CTTCCACAAA AGTTGTTGAG
TTAGGTAATG GTGTTGAGTT CAACCACTTT GCAACTCCTC GCCAAAACAA ACCATCACAA
CTTGCAGCGC TTTCAGGAAA AATTCTTGGC TATGCGGGAG CAATGGATTG GCTTGATGTT
GATTTGCTTG AAAAAGTAGC TCAAACCTAT CACCAATATC ATCTTGTACT GCTTGGTCCT
GCTTACGAGC ATGGATGGAT GGAACGGCAG TTAGGGTTGC AAGCGCTGCC CAACGTGCAC
TATTTCGGCA AAATTGAGTA CAGCGAATTA CCTGCATGGG TGCAAGCTTT TAGCGTTGCG
CTTATGCCGC TTGTTGCCAA TCCACTGAAA CAAGTGTCGC ATCCCAACAA GCTTTACGAA
TATCTTGCAA CGGGCGTGCC TGTGGTTGCT ATGAACTATT GCAGTGCAGT GGAAGCAGCG
GCTGACGTGG TGCATGTTGC TCAGTCGTAT GAAGAGTTTG TGCAGCTTGT GCCCATTGCG
TTGGCTGATA ATCGTCGTGA AGCACGGCAG GCATTTGCAA AGCAGCATAG CTGGGATGCA
CTTGCGGCTA CGATGGTTCA CGAGTTACAA CATGCTTGGC AGGAGAGTGC GCCATGA
 
Protein sequence
MKPYKLLWFS EIQWDFLSTR KQRLLARFPD EWHILFIEPF TLGRKHHWLP VKRGRVWVVT 
VPFLKTIPFR FGALLKRPLV RTLAGLPGIA IMHLWTLLLG FSSSQRIIAL SNPYWGKVAS
HLPCRFRCYD ANDDHLAFPS TPSWLPDWLQ RYLSTTSLVF SVSKELTARL PLSSSTKVVE
LGNGVEFNHF ATPRQNKPSQ LAALSGKILG YAGAMDWLDV DLLEKVAQTY HQYHLVLLGP
AYEHGWMERQ LGLQALPNVH YFGKIEYSEL PAWVQAFSVA LMPLVANPLK QVSHPNKLYE
YLATGVPVVA MNYCSAVEAA ADVVHVAQSY EEFVQLVPIA LADNRREARQ AFAKQHSWDA
LAATMVHELQ HAWQESAP