Gene Cmaq_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1011 
Symbol 
ID5709407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1061375 
End bp1063120 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content45% 
IMG OID641275512 
Productglycoside hydrolase family protein 
Protein accessionYP_001540832 
Protein GI159041580 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.956563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.837929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATTA GTAATAAGCT TGAAATCTCA AGTAACATAA CATCAATAAT AATGAATAAC 
ACGGGTATAG GGTTAAGATT ATTCCACGGG GAGTACGTTG AGGGTGAGTT AAGAGGTAGT
GGTGCTGCAT TACTTAGGAG TTATTGGGAT TATGGGGGAA TTAGGAGGTA TTGGGGTAAT
GGTGCCTTCG AATTCACCTC AATATACGGT GACTTAGCAG TCTACGTGAT AAGAGGCTAT
AGGGGTGTTA TTCAAGGGGA GTTTAAGGTA AGGGGCTTCA GTGGGGTTGA TGAATTAGGG
GATGGTTCAA TTAACGTTAA GCATGAGGGT GGTTTAATGA GAATTAAGGC TACTCAACCA
ATTAACCTAA GGGTAATGAA TAATGGATTC TCATTAACAA TTAACGTTAA TGATGAATTA
AAGGTGGCTG CAGCTGGAGG TGGACAGGTG AATGATGTGG ATAAGGTGCT TAACGATGAA
TCAATCATTG AGTCTAAGAG GAGACTATGG TTAAGCACCC TAATGAGCAA TATCAATGGG
AGTGATTTAA TTAAACTATG CTGGTACGTG ATATTAACTA ATAGGTGCAG TGTACCTAAT
CACCCGGCTT TAAGGAAGCC GTTCAACATG CCCAGTAAGT ACGTTTTCAG GCATCAATGG
CTCTGGGACT CCTCATTCCA CTCAATAGTC CTAAGGCATT ACGACGTTAA CATGGCTATG
GAGGAGTTGG AGAACCTAAT CCTGAATCAG AAGCCTGACG GTAGGATTCC GCACGAAATA
TTCATGTCGA AGGAGAGCTG TAAATCCTTC TGGGGTATTG ATGACTACTC ACCGTGGACA
ACCCAACCAC CTGTATTAGC GGTGGCCATT GATAAGGTTC TATCAGTGAG GTGGAATGAT
GAATTCGCCG AGAAGGCCTT TAATGCATTA ACCAAGTATG ATGAATGGTT TAGGAGTCAA
AGGGACAGGG ATTCAGATCA CTTATACGCC TACTTCGATC CACTGGAGAG TGGGTGGGAT
AATAGTCCCA GGTGGGATGA GGCCATTAGG AGGTTTAGGG AGAATCCGCA GCGTTACGAG
GTGTATGGGA AATTAACCAT GACTCCAGTT GAGGCTGTTG ACTTAAATAG CCTAATTTAC
CTTCAGAGGA GGGTTATCGC TAAGTTGGCT GAAAGGCTTG GTGAGGTTAA TGTTGCCGAA
CACTATGATG AGATGGCTGA TGAGACCGCT AAGGCGGTTA GGAGGATTAT GTGGAGTGAG
AAGGATGGTT TCTTCTATGA CGTGTATGAG GAGGGCCATG AATTAATCAA GGTTAAGACA
CCTGCAGCCT TCCTAACAAT GTTCACTGGA ATAGCTACGG GTGAGCAGGC TGAGAGACTG
GTGGCGCATT TGCTTAATCC AAGGGAATTC TGGACTACAT TCCCGCTACC AAGCGTGAGC
GCTGATGAAT CAACCTATGA TCCAACAGGC TACTGGAGGG GTAGGTCATG GATTAACCTA
GTGTGGTTCA CGTACCATGG GTTGAGGAAT TACGGTTACT ATGAGGAGGC CTCCAGGTTA
CTTAATAAAG TCCTTGAAGT AATGGGTAGG TCAATGACCT GTAATGAGAA TTACAATAGT
AGCACCGGGG AACCAATGGG TGCCCCTGAC TTCGGCTGGA CCACACTGAT AATAGACATG
GTTGCGAGTG AACTGGGTAA GGAGTCCCCT GGGGCTGCGT TTCATTATGG TACTTTACTT
AGTTAG
 
Protein sequence
MRISNKLEIS SNITSIIMNN TGIGLRLFHG EYVEGELRGS GAALLRSYWD YGGIRRYWGN 
GAFEFTSIYG DLAVYVIRGY RGVIQGEFKV RGFSGVDELG DGSINVKHEG GLMRIKATQP
INLRVMNNGF SLTINVNDEL KVAAAGGGQV NDVDKVLNDE SIIESKRRLW LSTLMSNING
SDLIKLCWYV ILTNRCSVPN HPALRKPFNM PSKYVFRHQW LWDSSFHSIV LRHYDVNMAM
EELENLILNQ KPDGRIPHEI FMSKESCKSF WGIDDYSPWT TQPPVLAVAI DKVLSVRWND
EFAEKAFNAL TKYDEWFRSQ RDRDSDHLYA YFDPLESGWD NSPRWDEAIR RFRENPQRYE
VYGKLTMTPV EAVDLNSLIY LQRRVIAKLA ERLGEVNVAE HYDEMADETA KAVRRIMWSE
KDGFFYDVYE EGHELIKVKT PAAFLTMFTG IATGEQAERL VAHLLNPREF WTTFPLPSVS
ADESTYDPTG YWRGRSWINL VWFTYHGLRN YGYYEEASRL LNKVLEVMGR SMTCNENYNS
STGEPMGAPD FGWTTLIIDM VASELGKESP GAAFHYGTLL S