Gene Cag_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1546 
Symbol 
ID3746546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2026285 
End bp2027334 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content41% 
IMG OID637774086 
Productglucose-1-phosphate thymidylyltransferase 
Protein accessionYP_379844 
Protein GI78189506 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000479594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAG AACTGATATA TAGTGCGTTT TCAAATAAAA TTGGACATCG AAAACGAGAT 
TGTAACAAAG CAATGAAAGC AATTATTCCT GTTGCGGGTG TGGGCACTCG TTTGCGCCCA
CATACTTTTT CTCACCCTAA AGTACTACTG AACGTCGCAG GCAAGCCCAT TATTGGCCAT
ATTATGGATA AGCTGATTGC TGCTGGCATT ACAGAGGCAA TTGTTATTGT TGGCTACCTT
GGTGATATGA TTGAGGAGTG GCTCTTGCAA AATTACGACA TCAAATTCAC CTTTGTAACA
CAATCGGAGC TATTAGGGTT GGGGCACGCC ATTTCAATGT GCAAGCCTTA CATTCCTGAA
GATGAGCCGC TCTTTATCAT TTTGGGAGAT ACTATTTTTG ATGTTAACCT TGAGCCTGTT
TTAAAAAGCA CCTGTTCAAC AATTGGCGTT AAAGAGGTGG TTGATCCTCG CCGTTTTGGT
GTAGCCGTTA CTGAAAATGG TGCCATTGTA AAGCTTGTTG AAAAACCCGA CACTCCAGTA
AGCAACCTTG CTATTGTTGG GCTCTACCTT TTGCAACATT CAGCCGCACT CTTTAAAAGC
ATTGATTACT TAATTGAGCA CAACATTACC ACAAAAGGTG AATATCAATT AACCGATGCT
TTGCAGCGCT TGCTTGACGA AGGCGAAAAG TTTACCACCT TCCCTGTACA AGGGTGGTAC
GATTGTGGTA AACCCGAAAC GCTGCTTGCC ACCAACGAAA TCTTACTGTC CGATAATCCC
CCATCTAAAA CATACCCTGG TTGCATTATT AACGATCCTG TGTTTATTGC AGAAAGCGCT
AAACTTGAAA ATGCCATTAT TGGACCTTAC ACCACTATTG GTGAAGATGT GGTTATTAAG
GATGCCATTA TTAAAAAGTC CATTATTGGC AACAAAGCCC AAGTAAAGCA CATTATGCTG
GGCAACTCCA TTATTGGCAA TAACGCCATT ATTCGTGGCA CTCCGCATGA AATTAATATT
GGCGATTTCT CTGAAATTCG TGTAAGCTAA
 
Protein sequence
MLKELIYSAF SNKIGHRKRD CNKAMKAIIP VAGVGTRLRP HTFSHPKVLL NVAGKPIIGH 
IMDKLIAAGI TEAIVIVGYL GDMIEEWLLQ NYDIKFTFVT QSELLGLGHA ISMCKPYIPE
DEPLFIILGD TIFDVNLEPV LKSTCSTIGV KEVVDPRRFG VAVTENGAIV KLVEKPDTPV
SNLAIVGLYL LQHSAALFKS IDYLIEHNIT TKGEYQLTDA LQRLLDEGEK FTTFPVQGWY
DCGKPETLLA TNEILLSDNP PSKTYPGCII NDPVFIAESA KLENAIIGPY TTIGEDVVIK
DAIIKKSIIG NKAQVKHIML GNSIIGNNAI IRGTPHEINI GDFSEIRVS