Gene Cag_0169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0169 
Symbol 
ID3747734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp189581 
End bp190696 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content49% 
IMG OID637772696 
Producttetraacyldisaccharide-1-P 4'-kinase 
Protein accessionYP_378490 
Protein GI78188152 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC CCCTGCGCCT TGCTTTTCGT CCTTTTGCCT TGCTCTACGA AGCAATTGTG 
CAAACTCGTA ACCAGCTTTT TAATCGTGCG GTATTAAGGG CATGGGAATC GCCAATGCCC
GTTGTTTCTG TGGGGAATTT AAGTGCAGGA GGTACAGGAA AAACACCAAT GGTGGATTGG
GTGGTGAAAT ATTATCTCTC AATAGGTTTT AAACCCGCAA TTATTTCGCG CGGCTATAAG
CGCCAATCAA AAGGGGTGCA GCTTGTGTCG GATGGCAATA ATGTGCTACT CAGTAGCCGT
GAAGCGGGCG ATGAAACCGC TATGTTGGCA TGGAATAACC CCGATGCAAT TGTGGTTGTA
GCAAGTAAGC GCAAGCAAGG GGTCAAGCTT ATTACCAAAC GCTTTGCCCA ACGCCTCCCA
TCGGTTATTA TTTTAGATGA TGCTTTTCAG CACCGCCAAA TAGCGCGTTC GTTGGATATT
GTGTTGGTGA ATGCCGAAGA GCCATTTGTG GAAGCTGCCA TGCTCCCCGA AGGGCGCTTG
CGGGAGCCAA AAAAGAATTT GTTGCGAGCA GATGTCGTGG TATTGAACAA AATTACCGAC
CTTGAAGCCG CAACACCATC CATTAAAGCG CTTGAGGAGA TGGGGCGACC ACTTGTTAAA
GCACGCCTGA GCACCGGTGA ATTAATTTGC TTTTCGGGCG ATGCCACCAC GCTTGACGAG
CCAGCCACCG CTCACCACCT GAACGCGTTT GCTTTTGCTG GAATTGCAAA ACCTGAAAGC
TTTGTAACAA GTTTGCAGCA CGAAGGGGTA AATGTGGGAG CAACCCGCTT TGTGCGCGAC
CATGCACCGT ACAGTGCCAA AATGTTACGA GCTATTCGCC GCCAAGCTGA GGAGCAAGGG
TTGTGCTTAA TTACCACCGA AAAAGATTAC TTCCGCCTGC TTGGGCAACC CGAACTCCTC
AGCATTATTA CCGCTCTCCC CTGCTACTAC CTTAAAATAG CCCCCGATAT TTTTGACGGC
AAAGCGCTTT TGCAAGAGAA GCTAAATGCG GTTGTTCATT ATGTACCAAA ACCGGAGCCG
CCAAAGAAAA TTGAGGAACC ATATCGGCGA TGGTAA
 
Protein sequence
MSNPLRLAFR PFALLYEAIV QTRNQLFNRA VLRAWESPMP VVSVGNLSAG GTGKTPMVDW 
VVKYYLSIGF KPAIISRGYK RQSKGVQLVS DGNNVLLSSR EAGDETAMLA WNNPDAIVVV
ASKRKQGVKL ITKRFAQRLP SVIILDDAFQ HRQIARSLDI VLVNAEEPFV EAAMLPEGRL
REPKKNLLRA DVVVLNKITD LEAATPSIKA LEEMGRPLVK ARLSTGELIC FSGDATTLDE
PATAHHLNAF AFAGIAKPES FVTSLQHEGV NVGATRFVRD HAPYSAKMLR AIRRQAEEQG
LCLITTEKDY FRLLGQPELL SIITALPCYY LKIAPDIFDG KALLQEKLNA VVHYVPKPEP
PKKIEEPYRR W