Gene Cmaq_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1029 
Symbol 
ID5710164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1081673 
End bp1083148 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content50% 
IMG OID641275529 
Productsugar-binding periplasmic protein 
Protein accessionYP_001540849 
Protein GI159041597 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0247417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCATAG TCATAGTGGC CGTGGTTGTG GTTGTTGTGG TGGTGGGTGG TGTGGTTGCG 
TATTATGTAA CAAGGCCAAC GCCAACGCCA ACGCCCAAGT CAATTCTATT CTATACCTGG
TGGGCCACCG AAGGTAGGGT GGCTGGTGAG CATACCTGGC CCCTGTTTTC GCAGTACTTC
CATATTTCCG TTTCACCGTA TGTTGTTCCC GGCGCTGGCG GTACTGCCGC TAAGTATGCT
ATTATAGCGT TGATAGAGGC TGGTAAGCCT CCAACCGCAT TCCAGAGTCA TGAGGGGCCT
GAGATGGTTA GTTATATTGA GATTGCGCCC CAGGGTGCTA ATTCATTCTA TAACACAACC
TCATACTGGA TGTCATTGGT GACCACGGGT AATGTGTCCA TCCCAGTAAT AGAGGCTGGG
ATGTACAATG GCCATATGTA CCTATTCCCA GTTAATGTAC ATAGGGGTGC GCTCCTATTC
TTTAACCCAC AGGTACTTCG TGAGTATAAC CTACCAATAC CAACCACAAT AGAGCAGTTA
TACTATGATG CCAAGGTTCT ATATCAGCAT GGAATATGCT TCATGATACC GGGCGCGGAC
AGTGGTTGGG ATCAATTCAA CCTATGGGAA AACATATTCC TGGCACTGGG CGGGCCCAAG
TTATACATGG AGTTCCTATA CGGAACCCTT AACTTAAGTG ACCCACAGGT TCAGGAAATA
ATTAACGAGA CCAACACCTG GTTCCTGAAG TTCCAAGCCC TCGACTGCCC AGGCTGGGAG
TCCCTGACCT GGACCCAGGG GCTAGCTTGG GTTATTGAGG GTAAGGCAGC ATTCCAGACA
CTTGGTGATT GGTGGGTTAA CTATGCCTAT GACTTCCTAA ACGCAACAAC ATACCCAGCA
ATACCACCAT ACACCAGCTG GACCAACATA ACAGTCATGA CGGAGCCATT CCCAGGGACA
GCCAATGTAT ATGCGCTGGA CGTTGATGCA GTGGCTGTGC CCGTGAGTCC CGAGGAGAAG
TATGGTGTTT TGTTTGCTGA GTGGTGGAGC TCCTGGTATT GGGGCAGTGG GATACCCGGT
GGCGATCCAT CATGGACCAA GTGGAAGTCA GCCACATTTT ACACCAACAT AACCACGGAC
TACTACAACA CGCCAGAGCA GTGGTGGAGT TATCAGCAAT TGACGAACAT GAGTAAGAAC
CCAAGTGATT GGTGTAACTT TGTTTATCAA CTAAGTGATG GTGGGGTATT CGATGACGTA
TTTGCACAGG TTAATCAAGG ACTACTCACA TACGCCGAGG TGGGTTCTGT GGGTACGTCT
CAATGGATGA GCACCCTGGC CTCGGCATTA GCTGAGGAAA AGGCAGAGTG GTTGAAGGCC
AATAGCCTAG GCCTTGGGTA CCTGGGCTGG CCAGGGCATT ACCTGGGCTG TTACGTACCA
CCGTGGGTTA GCAACACCAA CACCTCGGGT AGCTAG
 
Protein sequence
MLIVIVAVVV VVVVVGGVVA YYVTRPTPTP TPKSILFYTW WATEGRVAGE HTWPLFSQYF 
HISVSPYVVP GAGGTAAKYA IIALIEAGKP PTAFQSHEGP EMVSYIEIAP QGANSFYNTT
SYWMSLVTTG NVSIPVIEAG MYNGHMYLFP VNVHRGALLF FNPQVLREYN LPIPTTIEQL
YYDAKVLYQH GICFMIPGAD SGWDQFNLWE NIFLALGGPK LYMEFLYGTL NLSDPQVQEI
INETNTWFLK FQALDCPGWE SLTWTQGLAW VIEGKAAFQT LGDWWVNYAY DFLNATTYPA
IPPYTSWTNI TVMTEPFPGT ANVYALDVDA VAVPVSPEEK YGVLFAEWWS SWYWGSGIPG
GDPSWTKWKS ATFYTNITTD YYNTPEQWWS YQQLTNMSKN PSDWCNFVYQ LSDGGVFDDV
FAQVNQGLLT YAEVGSVGTS QWMSTLASAL AEEKAEWLKA NSLGLGYLGW PGHYLGCYVP
PWVSNTNTSG S