Gene Cmaq_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1475 
Symbol 
ID5709533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1552283 
End bp1553344 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content49% 
IMG OID641275984 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_001541289 
Protein GI159042037 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000811226 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.134994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGGAC TTATTTTAGT TGCTGGTATT GGTGAACGCA TGAGGCCATT ATCATACTCA 
ATACCTAAAC CCCTCATCTC CATCCTCGGT AAACCCCTTG TTGCCTACAC CATGGATAAG
TTAAAGGATA TTGATGTTAG TAGGATTGGG CTTGTTGTGG GTAGGTTTAG TGAATTATTC
ATGGACTACT TCAACAATGA CCCAAGACTC AACATCCCAG TAACCTACAT ACGCCAGGAG
AGGCGCCTCG GCATTGCCCA CGCCATCTAC AGGGGTATTG AGGAGGGTTT CCTAAGGGAG
GACTTCGTGG TTGCTCTGGG TGACAACTAC TTCTCGGAAT CATTCACGCG ATTCGCCAGG
GAGTTTCTTG AGGGTGGTTA CGACGTCTTC ATAGTCCTCA CTAGGCATCA GCAGTTTCAA
CGCTTTGGTA ATGCCGTGGT GGAGGGTGGT AGGGTGGTTA GGCTTATTGA GAAGCCTAAT
CAACCCATAC CTAACTCCTA CGTGGTCACT GGACTCTACT TCTTCCGTGA CCCTGATGCA
GTGGCTAAGG CGTTCTCCAA CCTGAGGCCC TCGGCCCGTG GTGAGTATGA GGTGACTGAT
TTAATACAAT GGTTCATAGA TAATAATTAC AGGGTTGGTT ACTCATTAAC CACTGGTTGG
TGGAAGGACA TGGGTACACC GGAGGATTTA ATAGATTTAG TGCAGTTGAT GCTTGATGAC
GCTAAACCGC GGATTGATGG TGACGTGAGG GGTAGGGTAA GTAGTAGGGT TATTGTGGAG
AAGGGTGCTG TCGTGGAAGG CGCAGTCCAT GGACCAGCCT ACGTGGGTAG GGGTGTCTAC
GTGGGTAAGG ATGCTGAGAT TGAGCACTTC GTGAGCCTTG AGGAGGGTGT ACACATGGAG
AGCGGCAGCA TATCAAGGAG CCTAATCCTC GAGGGCGTTA CACTACACCT GGGTAAGGCT
AGGTTAACGG ACTCGGTAAT AGGCCCTAGA TCATATGTAA TACTTAAAAA CGGTAGACAC
AGGATGATTA TTGGTGAAAA TGGTAGGGTT GAGGAGGTGT GA
 
Protein sequence
MLGLILVAGI GERMRPLSYS IPKPLISILG KPLVAYTMDK LKDIDVSRIG LVVGRFSELF 
MDYFNNDPRL NIPVTYIRQE RRLGIAHAIY RGIEEGFLRE DFVVALGDNY FSESFTRFAR
EFLEGGYDVF IVLTRHQQFQ RFGNAVVEGG RVVRLIEKPN QPIPNSYVVT GLYFFRDPDA
VAKAFSNLRP SARGEYEVTD LIQWFIDNNY RVGYSLTTGW WKDMGTPEDL IDLVQLMLDD
AKPRIDGDVR GRVSSRVIVE KGAVVEGAVH GPAYVGRGVY VGKDAEIEHF VSLEEGVHME
SGSISRSLIL EGVTLHLGKA RLTDSVIGPR SYVILKNGRH RMIIGENGRV EEV