Gene Cagg_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1083 
Symbol 
ID7268535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1340627 
End bp1341616 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content54% 
IMG OID643565928 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_002462433 
Protein GI219848000 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000927283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGGAAT TACGGCTGAA TATTGCTACC CGTGAATGGG TAATCATTGC CAGCGAACGT 
GCCCGCCGTC CCAATGCATT CACCGAGACG CGACACCAAC CACGAACTGC CGAGCGTCCA
CTGCACGATC CGCACTGTCC CTTCTGTGTT GGTAACGAAG AGCTTGACCT TGAAGTCGAA
CGATACCCGG CGACCGGGCC GTGGCAATTA CGCATTGTTC GCAATAAGTA TCCGGCATTG
CACGATCAGG GGCCGGTGAT GCGTCGTTTT GATGGTCTGC GACGCACTCT GAGCGGCTAT
GGTTACCACG AGGTGCTGGT CGAGCATCCC CATCACAATA CAACGTTGGG GTTAATGACC
AATGCCGAGG TAAAGGCTGT GCTGGAAATG TATCTGCGGC GTGGTCGGGC AATGAGTGCC
GATCCGCGGG TAGAGCAGGT GGTTATTTTT AAGAATCACG GTGAACGGGC CGGTGCCTCG
TTACAGCATC CGCATAGTCA ACTGATAGCT GTGCCGGTAG TCCCGGCTGA TGTTCGGCAT
CGGATTGAGG AGGCGCGTCG GTTTTTTGAT GATACCGGCC AGTGTGTCTT TTGTGCAATG
CTGGCCGATG AGCTGGCCAG TAACGAACGA TTGGTGTATG CAAACGATGA TTTTGTCGCG
TTTGTGCTCT ACGCAGCCTC TTCCCCATTC CACATCTGGA TCTTGCCGCG TAGACATCGG
GCTAGTTTTT TTCATATCGA TGAGACGGAA CTTGACGGTC TGGCCGATGT AGTGCGGGAA
GTGTTTTATC GCCTCTACTA TCGCCTCAAC GATCCCGATT TTAATCTGGT GCTCCGCTCG
ACGCCGGCCA AAGAGCCGGA GAATGGCTAT TTTCACTGGT ACCTGGCCGT TGTCCCACGG
CTGTCGTATA TGGCCGGCTT TGAGATGGGG AGCGGTATTT TTATCAATCC CAGTATTCCC
GAAGCCTGCG CCGCTTTTCT GCGTGAATAA
 
Protein sequence
MSELRLNIAT REWVIIASER ARRPNAFTET RHQPRTAERP LHDPHCPFCV GNEELDLEVE 
RYPATGPWQL RIVRNKYPAL HDQGPVMRRF DGLRRTLSGY GYHEVLVEHP HHNTTLGLMT
NAEVKAVLEM YLRRGRAMSA DPRVEQVVIF KNHGERAGAS LQHPHSQLIA VPVVPADVRH
RIEEARRFFD DTGQCVFCAM LADELASNER LVYANDDFVA FVLYAASSPF HIWILPRRHR
ASFFHIDETE LDGLADVVRE VFYRLYYRLN DPDFNLVLRS TPAKEPENGY FHWYLAVVPR
LSYMAGFEMG SGIFINPSIP EACAAFLRE