Gene Cmaq_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1074 
Symbol 
ID5710373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1125054 
End bp1126127 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content47% 
IMG OID641275574 
Productpyridoxal-5'-phosphate-dependent protein beta subunit 
Protein accessionYP_001540893 
Protein GI159041641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0511297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.236924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCTTA ACTTAACTAA CATGAGCGGG GAGGTAACGT ACAAGTGCCC TAAATGCGGC 
TTCACAACCG AGGCTAATAC CTGGCTTATC AAGTGCCCTA GATGCGGGTC ACCGTTGAAT
GTGAATTATG ATTTAAGGAG ACCTAGGGAG CTTAGTAGAA GTGAATTAAC GAGGATTCTA
CCTGTTAAGG AGCCGTTAAG CCTTGGTGAA GGTTTAACCC CACTGGTTAG GAGGGGTGAT
TACTACTTTA AGCTTGAGTA CCTTAACCCC ACTGGTTCAT TTAAGGATAG GGGGTGGAGT
CTTGCCCTAT CAGTACTACG TAATGACGTC ACTGTGGTTG AGGATTCAAG CGGTAATGCT
GGACTATCCC TAGCAGCATA CTCCGCGGTT AAGGGGGTTA GGGCTAGGAT TTACGTCCCT
AAGACGGCCC CTGAGGCTAA GAAGAGACTT ATGAGGCTCC TAGGCGCTAA TGTGGTTGAG
GCTGCAACTA GGGCTGATGC ATCATCACTT GCCATGAGCT TCACAGAGGG GGTTTACGTT
GGTCATTCAT GGAACCCCTT CTTCATACAT GGCGTTAAGT TAATAGCTTA TGAACTAGCA
TTAGAATTAG GGAACATTGA TAATGTTGTT GCACCCCTAG GTAACGGTAC CTTAACCCTA
GGCCTATACT TAGGTTTCAA GGAGGCTGAG GAATTGAAGC TTATTAAAGA CACACCTAGG
ATAATAGCCG TTGAGGCATC AGGCTACGAG TGGGCTTACA GTATGCTTCA CAACACACCC
ATGGGTGTTA AGGCAACCTT ACCCGACGGC ATAATAGTGC CCCAGCCACC TAGGTTAACT
CAGATAATTG ACGCGATACG GGACACTGGA GGTGACGTGG TGGTTGTTAA TGATCAGGGG
GTTATTGAGG GGTTAAGGGA GGGTATTAGG TTAGGGTTCA TAATTGAGCC AACAAGCGCA
GTTGTCTTTA AGGCCCTTAA GGAAGTGAAC CTAAGTGGCA CAACTGTAGT TATTTTAACG
GGTTCAGGCT TAAAGCTGAG TAATGAACTG TATCGGTTAA TATACGGTGA ATGA
 
Protein sequence
MVLNLTNMSG EVTYKCPKCG FTTEANTWLI KCPRCGSPLN VNYDLRRPRE LSRSELTRIL 
PVKEPLSLGE GLTPLVRRGD YYFKLEYLNP TGSFKDRGWS LALSVLRNDV TVVEDSSGNA
GLSLAAYSAV KGVRARIYVP KTAPEAKKRL MRLLGANVVE AATRADASSL AMSFTEGVYV
GHSWNPFFIH GVKLIAYELA LELGNIDNVV APLGNGTLTL GLYLGFKEAE ELKLIKDTPR
IIAVEASGYE WAYSMLHNTP MGVKATLPDG IIVPQPPRLT QIIDAIRDTG GDVVVVNDQG
VIEGLREGIR LGFIIEPTSA VVFKALKEVN LSGTTVVILT GSGLKLSNEL YRLIYGE