Gene Cmaq_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1424 
Symbol 
ID5709294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1503569 
End bp1505908 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content44% 
IMG OID641275934 
Productextracellular solute-binding protein 
Protein accessionYP_001541239 
Protein GI159041987 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0120501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG TCATGAAGGC AGTAATACTA CTCCTTGCAG TAACACTACT GGTGACTACC 
GTGGTTTATG CGCAGCAGAC TTTCACAGTT ATTAATGCTG AATCAAGCAC GTACATATAT
GCACCTGGAA TACCAGTGTT CAACCCCTAC ACACCCAGTA ATCTAGTGGG TGTTGTTTCC
ACGTGGGTTC CCCTAGCGTT CTATAACCCA GTTACTAATC ATTTCTGGCC TATTCTTGCT
GAGAATTGGA CTATTCAGGT TTTGCCTAAT GGTTCAGGTA TTTTGACTGT TTACCTTAGG
AGGGGCTTGT ATTGGTTTAA TGGTTCAGCG GTAATGCCTT TCACTGCTTG GGATGTTTAC
ACATACTTCT ATATTGGTGT TAAGGCGTTT AAATGGTACT ACCCATACAT GTTACCTCAA
TATGCTGATG AAGACATTAG GGTATTGGAT AACTATACTA TTCAATTCCT ATTCCAGAAA
TGGAGCCCAA CAGAGTGGAT ATTCTTGCTA GCAAGCCAGA TTAGTACACC ATACTCAGTA
TGGGAACCAA TAGTGGATAA ACTCAAAACA ATGAACGTTA CTCAGACGGC AAAGTACTCA
ACGAACGTAA CGGAGTTTGT ACCACCGTAC TGGGGTTTAA GCCCATACTA CTTAACCTTC
ATTAGCACTA ATACTGTTGT TGAGAAGCTT GAGCCCATGT ACTTTAACGG TAAGCCGTTA
TTAGCCATTT GGGATGAATT ATTCCCATTC AACACATTCA ACTACTACCC TGAAGTAGAG
TCAGTGTACC CTGGTGGTAA TACTCAAGAC TTAGCCCTTG AAATCGCTGG TAAGGCTAAT
TGGGCATACG TTGGCTTATC ATCACAGCAG ATAGAGACAG CAATGCAGCA TGGGTTTAAG
AACATTAACC TTTACGCATG GTCAACCTAC GCTATCGCCA TTAACGCCTA CAACTTCCCA
ACCAATAACA TGTGGCTTAA CCTTAAGTTT AGGCAAGCAT TATTATGGGT CCTTAATAGG
ACTGAAATGG CAGCTGCATG GGGCTTACCT GGTGTACCTA ATTCATCATG GATACTGCCT
CTATGGCGTA ACATACCAAG CCCAGACTAC ATACTCTCCA CGTATCCTCA GCCAATACAG
AACATGGTGT ATGATTACAT GACGTATACC GTTAATTGGA CTAAGGCCGC TGAAATACTT
GAGAGTGCTG GTTTCTATGA GAAGAATGGG CAATGGTATA CTCCAGGTGG TCAACCAATT
AAGATGACGC TAATGATGCC TGCCCAATTC ACTAACCAGG TCGCCGCTGT TTCTAATGCT
GCGGAGCAGT TTAGTATCTT TGGTATACCA ACCACTACAC TTGCTGAGGA TGTTGCAACA
TACTCAAGTC ATATACTGAT AACTGGTGAT TACCAGGCTG CTGATTACTT CGGGCCTGGT
TTAGTAAGTT ACTACACTGG TTGGTCAATG TGGGTTAATC CCTTCGACAG TAACCTATTC
ATAAACACAT CATTGCCTTA CCCATTCCAG TGGCCTAATG GAACATGCAC ACCAGTGACG
TTATCATTAC CCATGCCTAA CTCCACTATA GTTACATGCA TTAACTCAAC CCTCGGTTGG
ATTAACATAA CCAATGCTGA GTACGCCTTC TACTCAGCCG TACCAGGTAG TCCACAGTAT
GAGGAGGCTG TGGAGGTATT GTTTGCATGG TGGAATTACT TTGTGCCTCA ACTAGAGTGG
GGTGAGAAGC TTGAGCCGCA GCAGTGGGAT CCTAATGTAT TTGACTTGGA TTGGGCGTAT
GAATGCTCCA ATGCACCAAC CGTTAATATG GCTATTGGTC CAGCCAACCT AATAGCTCAA
TACGTGGTTA TGCCGCCTCA GCAGGTTTGG GGTGTTCCAG GTGTTTCAGG TGGCGTTTGG
ATGCCTGGTC CATTATACTT CGGTGGTGTT GTTCCTCCTG GCGTAATACC ACCATTAGCG
GAAGCTATGC TTAATGGTTC ACTTTGGACT AAGTACGCCA ATTACGCAAA CTTCCTCGGC
ATAACACCAG GCAGCTTTAA CTTAGCTTGC GTTGCATCGT ACTTCCATAC AATATACACT
CCAGTAACCG CATCAACCAC TACCTCAACA ACCACTGTTA CTTCAACTGC TGTTGCTACT
GTGACTAGTA CTGCCACTGT AACATCAACA AGCACAGTAG TAAGCACAAC AACCACTACG
GCTGTAAGCA CAGTAACAAT CACTAAACCA GTAGTATCAA CAACACTAGT AATAGGAATA
GTAATCATAG TAATAGTCAT AGCAGCAGTA GCAGCAATAA TAGTGTTAAG GAGGAGGTAA
 
Protein sequence
MSGVMKAVIL LLAVTLLVTT VVYAQQTFTV INAESSTYIY APGIPVFNPY TPSNLVGVVS 
TWVPLAFYNP VTNHFWPILA ENWTIQVLPN GSGILTVYLR RGLYWFNGSA VMPFTAWDVY
TYFYIGVKAF KWYYPYMLPQ YADEDIRVLD NYTIQFLFQK WSPTEWIFLL ASQISTPYSV
WEPIVDKLKT MNVTQTAKYS TNVTEFVPPY WGLSPYYLTF ISTNTVVEKL EPMYFNGKPL
LAIWDELFPF NTFNYYPEVE SVYPGGNTQD LALEIAGKAN WAYVGLSSQQ IETAMQHGFK
NINLYAWSTY AIAINAYNFP TNNMWLNLKF RQALLWVLNR TEMAAAWGLP GVPNSSWILP
LWRNIPSPDY ILSTYPQPIQ NMVYDYMTYT VNWTKAAEIL ESAGFYEKNG QWYTPGGQPI
KMTLMMPAQF TNQVAAVSNA AEQFSIFGIP TTTLAEDVAT YSSHILITGD YQAADYFGPG
LVSYYTGWSM WVNPFDSNLF INTSLPYPFQ WPNGTCTPVT LSLPMPNSTI VTCINSTLGW
INITNAEYAF YSAVPGSPQY EEAVEVLFAW WNYFVPQLEW GEKLEPQQWD PNVFDLDWAY
ECSNAPTVNM AIGPANLIAQ YVVMPPQQVW GVPGVSGGVW MPGPLYFGGV VPPGVIPPLA
EAMLNGSLWT KYANYANFLG ITPGSFNLAC VASYFHTIYT PVTASTTTST TTVTSTAVAT
VTSTATVTST STVVSTTTTT AVSTVTITKP VVSTTLVIGI VIIVIVIAAV AAIIVLRRR