Gene Cmaq_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1972 
Symbol 
ID5708446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp2048560 
End bp2050005 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content47% 
IMG OID641276482 
Productprolyl-tRNA synthetase 
Protein accessionYP_001541778 
Protein GI159042526 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTAGGG GGCCTCAGGG GAGGCCTAGG AGTAGGTGGG TTTCATTCAT TGAGTGGTTT 
AATAAGGTAA TAATGGATGC TGAAGTGTAT GACTACAGGT ACCCTGTTAA GGGGGCCTAC
ATATGGAGGC CTTACGGTGT CGCTATTAGG CGTAATGTGG AGGCGTTAAT ACGTCGCCTA
CATGATGAGA CAGGGCACCA GGAGGTATTA TTCCCCGTGT TCATCCCCTA TGAGTTCTTC
TCAAAGGAGA GTGAGCATAT TAGGGGTTTT GAGAGTGAAG TCTTCTGGGT TAGTAAGGGT
ACTGGTGGTG AGGAGCGCCT AGTGTTGAGG CCGACCAGTG AGACTGCTAT GATGCCTATG
TTTAAGCTTT GGATAAGGGA TCACACGGAT TTACCATTGA GGGTGTATCA AATAGTCAGT
GTGTTTCGTG CTGAAACAAA GATGACTCAC CCAATGATTA GGCTTAGGGA GATTTCAATG
TTTAAGGAGG CTCACACTGC CCATGCGGAT AGGGATGATG CTGAGAGGCA GGTTAAGGAG
GCTGTAGGCA TATATAGGAG GATTATGGAT GAATTATGCA TACCGTACTT AATAAGTAGG
AGACCGGATT GGGATAAATT CGCAGGTGCA GTATACACCA TAGCCTTCGA TACAATAATG
CCCGACGGTA GGACAATGCA AATAGGCACT GTTCACTACC TGGGTGAGAA CTTCTCAAGG
GTTTTTGACG TGAAGTACCT GGGTAAGGAT GGGCAAATGC ATTACATTCA CACCACCAGT
TACGGAATAT CTGAGAGAAT CATAGCGTCA ATGATCGCTG TTAACGGTGA TGATAGGGGT
TTACTCCTAC CCCCAAGGTA CGCTCCAATT CAAGTAGTGG TAATACCGAT AATGTATGGT
GAGGATCAAA GCGTATTGAA TTACGCTAAG GGTGTAAGCG GTGAATTACT TAATGCCGGT
GTGAGGGTTC ATGTTGATGA TAGGAGGGAT AAGACACCTG GCTGGAAGTA CTACCACTGG
GAGCTTAAGG GTGTTCCAAT TAGGCTGGAG GTGGGGCCGA GTGATGTTAA GGATAATGCA
GTAACATTAA CCAGGAGGGA TACATTCGAG AAGTATGCCG TGGAGAGGAG TAATGTCGTT
GATGCCGTGA GGGAATTAAT GAAGGCTATA GAGGATAATA TGCGTAAGTC AACGTGGGAG
TGGTTAAGGA GCCACGTTAG GAGAAGCAGC AATGTAAGTG AAGCTAAGGC ATTGCTCAAT
GAGGGTGGTG TGGTTGAGGT TCCGTGGAGT GGCGATGATG AATGCGGTAG GAGAATAATG
GAGCTCACTG AATCCGATGC ATTAGGCATA CCACTGGATA CTGATGAAAC CCCAAGTGAC
CTTCGTGACG CAGCCTGCAG TGAGAAGAAG GCTGAGTACT GGCTTAGATT ATCAAGGAGG
TATTAA
 
Protein sequence
MVRGPQGRPR SRWVSFIEWF NKVIMDAEVY DYRYPVKGAY IWRPYGVAIR RNVEALIRRL 
HDETGHQEVL FPVFIPYEFF SKESEHIRGF ESEVFWVSKG TGGEERLVLR PTSETAMMPM
FKLWIRDHTD LPLRVYQIVS VFRAETKMTH PMIRLREISM FKEAHTAHAD RDDAERQVKE
AVGIYRRIMD ELCIPYLISR RPDWDKFAGA VYTIAFDTIM PDGRTMQIGT VHYLGENFSR
VFDVKYLGKD GQMHYIHTTS YGISERIIAS MIAVNGDDRG LLLPPRYAPI QVVVIPIMYG
EDQSVLNYAK GVSGELLNAG VRVHVDDRRD KTPGWKYYHW ELKGVPIRLE VGPSDVKDNA
VTLTRRDTFE KYAVERSNVV DAVRELMKAI EDNMRKSTWE WLRSHVRRSS NVSEAKALLN
EGGVVEVPWS GDDECGRRIM ELTESDALGI PLDTDETPSD LRDAACSEKK AEYWLRLSRR
Y