Gene Cmaq_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1035 
Symbol 
ID5708940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1086633 
End bp1087742 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content47% 
IMG OID641275534 
Productsaccharopine dehydrogenase 
Protein accessionYP_001540854 
Protein GI159041602 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.002622 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGTTGA TTACCGTAGT GGGTTTGGGT CCAATGGGTA GGGCTGCCTC ATACTACTTG 
ATTAAGCACA CTAATCATGA GGTATTGGGT TACGATAAAT CCAGTGAGGC GGTTAATGCT
GCATTAAGCC TGGGGATTAA TGCCGTTAAG GCTGATGCCA TGAATGATGA GGTCGCCAGG
AGGATTGCCG GTGATTCTGA TGTAGTATTA ACCGCTGTGC CACAGTCCAT TGCCGATTCA
CTGGTTTTTA AGCTTCATGA ATATGGGGCT AAGGTTATTG ATTTAATATT CCTATGGAGG
TTTAATAGTG AAACCGCTGG GAGGGTTAGT AATGGATCCC TTGTAATACC GGCCTGCGGT
TGGGCTCCAG GCTTAACCAA CCTACTTGCA GCTGCAGCAT CCTCAGAGTT GGATGCTGTT
GAGGAGTTGG GGATTCATGT TGGTGGTAAT CCAGTGAACC CAAGGCCACC ACTGTACTAT
GATGTGTTAT TCTCCGTTGA AAGCACCATT GAGGAGTACG TGAGGCCGGC CACAATTATG
ATTAACGGTG AGTTAAAGAG TGTTGACCCA TTATCATCAA TATACCCGTT TAGGACCTGG
CTCATTGACG GTGACTTCTC TGAATTCTAC ACCGACGGTT TATCAACACT AATCGTCACC
ATGCCTAGGC GGTTTAAGTC AATTAAAACA ATGTATGAGA GAACCATTAG GTGGAGTAGG
CACCTTGAGG TTATGAGGAT TCTTAAGGAT GTTGGCTTAC TGGGTAGTGA GGCATTGGTT
AAATCCTTAA GCAGCATAAT GAGGCAGGGT GTTGAGGATT TCTCACTAAC TGTGGTGGAG
GCTAGGGGTG TGATTAATGG TGAGCCTGCC AGGGTTGCCT TTGAGGGTAT TGATTACGCA
AGGGGTGGCT TCACGTCAAT GGCTAGGTTA ACTGGGTTCA CAGCAGCCGT GGTAGCTGAC
TTAGTGGCAA GGGGTGTGAT TAAGGGTAGT GGCTTGCTCC CAATTGAGGA GGCTTATTTT
GAGAATAAGG ATGTATTGAG GCATGTTCTC AATGGACTTA AGGCTGAGGG CGTGAAGCTT
ATGCTCACTA AAACAGTAAC CGCACCCTAG
 
Protein sequence
MGLITVVGLG PMGRAASYYL IKHTNHEVLG YDKSSEAVNA ALSLGINAVK ADAMNDEVAR 
RIAGDSDVVL TAVPQSIADS LVFKLHEYGA KVIDLIFLWR FNSETAGRVS NGSLVIPACG
WAPGLTNLLA AAASSELDAV EELGIHVGGN PVNPRPPLYY DVLFSVESTI EEYVRPATIM
INGELKSVDP LSSIYPFRTW LIDGDFSEFY TDGLSTLIVT MPRRFKSIKT MYERTIRWSR
HLEVMRILKD VGLLGSEALV KSLSSIMRQG VEDFSLTVVE ARGVINGEPA RVAFEGIDYA
RGGFTSMARL TGFTAAVVAD LVARGVIKGS GLLPIEEAYF ENKDVLRHVL NGLKAEGVKL
MLTKTVTAP