Gene Cagg_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3037 
Symbol 
ID7266568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3692343 
End bp3693389 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content59% 
IMG OID643567858 
ProductPfkB domain protein 
Protein accessionYP_002464332 
Protein GI219849899 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2870] ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 
TIGRFAM ID[TIGR02198] rfaE bifunctional protein, domain I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000272424 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGCCC CGCTGCCATA CCCTATCACT CGTATGCCAA CCGCCTTTCC GCATCACACG 
CAGTTGCCCG ATCCAACGGC TCTCGCCGGT CGGCGGGTTG TTGTGGTCGG TGACATTACC
CTTGACGAGT ACCTTTACGG TCGGCCTACC CGCCTCTCGC GCGAAGCACC GATCCCGGTG
TTAGAGTACC TACGGCGTGA AACTATTCTC GGCGGGGCAG CCAATCCGGC TCGTAATATC
GTCGCTCTTG GCTCATACGC CAGCCTCGTC GCCGTTGTCG GTGATGATGA AGAGGGCGAT
CATCTTCGCA CCCTCCTGCA CACGGCAGCG ATTGACGACA GTGGCGTCAT CACGATTACC
GGACGAATGA CAACCCGCAA AACTCGGATT CTGGCCGATG CCAGCCCACG GCTTCCCCAA
CAGGTTGCAC GCCTCGACCG ACTCGACCGC AGCCCTCTCG CATCGGTTAC TGAAGAGCGG
GTGATCGCAG CACTTGCCGA GCAGATTCCG CATGCTGATG CAGTAATCTG TTCCGATTAT
CAACTAGGGT TACTCACACC GCGGGTTGTT GATGCTGTGC GTGATTTATG TCGACGGCAT
GGGGCAATCT TTGCGGTTGA TGCCCAGGGG AATGCCCATT ACTACCACCA CGCCAGCCTC
TTTCGCTGCA ACGACGCCGA GGCCGCCGCA ACGCTAGGCA TGTCTACAAT TGACGATGAA
ACCATCACCG GTGCGATTAG CCGGCTGTAC CACGAACTCG CCGCGCGTCT CGTGATCGTT
ACCCGTGGCC CTGCCGGATT AGCGCTAATC GGTGATGACG AGCCATTCCT CCAACTCCCG
GCTTACCGTG TCAGCGAAGT CTTTGATACG ACCGGTGCCG GCGATACGTT CATTGCCGTA
GCAACGCTGG CACTGGCTGC CGGTTATCGT GGCAAGATTG CAGCAGCATT AGCCAACATC
GCCGCAGCAC TTGTCGTCAG GCGGTTGGGC AATGCGGTTG TAACGCCAAC CGAATTGGCC
GCGGCAATTA CCACTGCTGA AGTGTGA
 
Protein sequence
MTAPLPYPIT RMPTAFPHHT QLPDPTALAG RRVVVVGDIT LDEYLYGRPT RLSREAPIPV 
LEYLRRETIL GGAANPARNI VALGSYASLV AVVGDDEEGD HLRTLLHTAA IDDSGVITIT
GRMTTRKTRI LADASPRLPQ QVARLDRLDR SPLASVTEER VIAALAEQIP HADAVICSDY
QLGLLTPRVV DAVRDLCRRH GAIFAVDAQG NAHYYHHASL FRCNDAEAAA TLGMSTIDDE
TITGAISRLY HELAARLVIV TRGPAGLALI GDDEPFLQLP AYRVSEVFDT TGAGDTFIAV
ATLALAAGYR GKIAAALANI AAALVVRRLG NAVVTPTELA AAITTAEV