Gene Cmaq_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1144 
Symbol 
ID5710144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1200083 
End bp1201249 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content42% 
IMG OID641275643 
Productpyridoxal phosphate-dependent enzyme, putative 
Protein accessionYP_001540961 
Protein GI159041709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1921] Selenocysteine synthase [seryl-tRNASer selenium transferase] 
TIGRFAM ID[TIGR01437] uncharacterized pyridoxal phosphate-dependent enzyme 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.46182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.521262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTT TGGATAAGTT AGGTGTACGT AAGGTTATTA ATGCCTGTGG TACACTCACA 
GTACTTGGTA GTAATAGGGT TAGTTCAAGG GTTTTAGAGG CAATGAGGGA GGTTGCGGAT
TCCTTCATTG ATATGAATGA ACTCCTAGTT AAGTCAGGGG AATACATTGC CAAGTTACTA
AATGTACCCG GTGCATTAGT AACAAGCGGT GCCGGAGCCG GCTTAGTACT GGCTGTTGCA
GCAGCTATTA CTGAGGGTGA TGTGGATAAG ATGAGTAGGT TACCCTTCAC TGATGGGTTA
AGGAATGAGA TTATTATCCA ATATCCACAC ACAGTGGGTA ATCCATACGT TTACCTCATT
AATATTCCAG GGGGTAGAGT AAGGATTGTG GGTTCACCAA GTGGTGTTAA TGAAAACGAT
ATTAAGAATG CCTTAAATAA AAACACAGCC GCAGTACTTC ACTTCCAGTA TGAGCCACAG
GAGGGTGAGG TGCCTTTAAG TAAGGTTATT GATATTGCCC ATGAATTTAA CACGCCAGTT
ATAGTTGATG CCGCTGCCGA ACTGCCACCA TTACTTAACT TAACAAGGTT CATTAAAATG
GGGGCTGACT TAGTAGTGTT CAGTGGCGGT AAGGATATTG GTGCACCCGG TGATACAGGC
TTGATTCTGG CTAATAATTT AAGGCTCCTT GAGGCGTGTA GGTTAATGAG CCCATTCAGT
TACATTAATG TTAATGGGCA ATCCAGGGTA TTCATAGGTA GGGTAATGAA GATTAGTAAG
GAGGATATTG TAGCCCTAGT CGCGGCACTG GAGGAGTACG TTAAGGTTAA TCATGAGGAG
AGGTTAAGTG TAATGAATAA GATGGCTGAT GAAGTAATAA GTGAATTAAC CGCAGTATTA
CCGGGTATTA GGATTGAGAA AAGGCTGAAT CATCCTGGGG AGAGGATAAG GCCGGTAACA
GTACCTAAGG TTGAGATTAA GTTACCGAGA AGGTACACGG AATTATACAT TAAGTTACTA
AGGGAGGGGG ATCCACCAAT ATACGCATGT GAATGTGAAG GTAATTTATG CATTAACATG
CATACGTTAA GCCAGGATGA GGTTCCCATT GTTATTAACA GGTTAAAGGA GGTGATTAGT
AGGTATCCGC CAGTAACTAA TCAATGA
 
Protein sequence
MGVLDKLGVR KVINACGTLT VLGSNRVSSR VLEAMREVAD SFIDMNELLV KSGEYIAKLL 
NVPGALVTSG AGAGLVLAVA AAITEGDVDK MSRLPFTDGL RNEIIIQYPH TVGNPYVYLI
NIPGGRVRIV GSPSGVNEND IKNALNKNTA AVLHFQYEPQ EGEVPLSKVI DIAHEFNTPV
IVDAAAELPP LLNLTRFIKM GADLVVFSGG KDIGAPGDTG LILANNLRLL EACRLMSPFS
YINVNGQSRV FIGRVMKISK EDIVALVAAL EEYVKVNHEE RLSVMNKMAD EVISELTAVL
PGIRIEKRLN HPGERIRPVT VPKVEIKLPR RYTELYIKLL REGDPPIYAC ECEGNLCINM
HTLSQDEVPI VINRLKEVIS RYPPVTNQ