Gene Cagg_3363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3363 
Symbol 
ID7267103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4076307 
End bp4077446 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content55% 
IMG OID643568172 
ProductMonogalactosyldiacylglycerol synthase 
Protein accessionYP_002464643 
Protein GI219850210 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.879396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000552122 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGCGAG TGCTGATCTT ACACGCCTCA GTAGGAACCG GTCATAAGCG TGCTGCTGAA 
GCGCTGGCTG CCGCCTTTAG CCGTCGCCAA CCTGGTGAAG TGCGTGTCGA GGATGTCCTC
GATCACACCT CCCGTCTCTT TCGCTTCGCA TATGCTCGTT CCTATCTTGA ATTGACCGAC
CGCGCGCCGT TGGTGTGGGG CTATTTCTAC ACACAAACGA ACGCCGACCC CAATCTAGCC
GAGATCACCA ACAATATTCG CAAGTTGGTT GAAAGTATCG GGACGAACGG CCTGAAAGAG
GTCTTACGCG CCTTTCAGCC AGATGTTATC ATTTGTACTC ATTTCTTGCC CATGGAGCTG
CTGGTCAGCT ACAAACGCAG TGCGCGCCTG ACCGAACCGG TCTACTGCGT TATTACCGAT
TACGCCGCCC ATACCTTCTG GACGTACACC GAAATCGATG GCTATTTCGT CGGTGATGAA
CAGACACGCG CACAGCTCAT CGAGCGTGGT GTTAGCCCGC AGCAAGTGGT TGTGAGCGGT
ATTCCAATCG ATCCATGCTT CGCCCAACCG AACGATAGCC GTGAAGCCCG GATACGTCGT
AACTTGCCGC CAGAGGGGAC GGTGGTGACG CTGTTTGGTG GTGGGGTTGA CGATGATCAC
GTGCGGCTGA TCGTCAGCCA ACTTATGCAA AGCCCACTAA AAGCGACGCT GGTGGTGGTC
GCCGGGCGAA ATACGACCCT AGTCGAGTCG TTGAGCGATT TTATTTCGAC CCCGAATATC
GATCTGCGGG TCTTGGGATT TATCGATTAT GTCGATGATC TGATTACGGC GAGCGATTTG
GTGATTACAA AGGCGGGTGG GCTGATCGTG AGCGAGATTC TCGCCCGTGG TACACCGATG
ATTATCATTG ACCCTATTCT CGGCCAAGAG GAGTGGAACG CCGATTACGT CGTCAGTACC
GGCAGCGGGA TCCAATTGCG CATGTGCGAA TCGACCGCAC GGGCCGTGCT CAATCTGCTG
AACCACCCTA CAATGTTGGC TGAAATGCGA CGGTGTGCAA AGGCGGCGTC GCATCCCAAT
GCAGCTCTCG ATATTGCCGA AAAGGTGATC GCCGATCTTG AGAGTTATCG TCACGCCTAA
 
Protein sequence
MPRVLILHAS VGTGHKRAAE ALAAAFSRRQ PGEVRVEDVL DHTSRLFRFA YARSYLELTD 
RAPLVWGYFY TQTNADPNLA EITNNIRKLV ESIGTNGLKE VLRAFQPDVI ICTHFLPMEL
LVSYKRSARL TEPVYCVITD YAAHTFWTYT EIDGYFVGDE QTRAQLIERG VSPQQVVVSG
IPIDPCFAQP NDSREARIRR NLPPEGTVVT LFGGGVDDDH VRLIVSQLMQ SPLKATLVVV
AGRNTTLVES LSDFISTPNI DLRVLGFIDY VDDLITASDL VITKAGGLIV SEILARGTPM
IIIDPILGQE EWNADYVVST GSGIQLRMCE STARAVLNLL NHPTMLAEMR RCAKAASHPN
AALDIAEKVI ADLESYRHA