Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0464 |
Symbol | |
ID | 5708785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 503032 |
End bp | 504303 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641274967 |
Product | solute binding protein-like protein |
Protein accession | YP_001540299 |
Protein GI | 159041047 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.153689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGCAA TAGGTAAGTG GGTTACGTTC TTATTAGCAA TAATGATCGC AGCCAGTCTC CTAGTTACAT ATGCCCAGCA AGTGGTGCCG CTGGTTCCAG CCTTTAACAC TACGTCAAAC GTAGTGGTTA TTGGCAATAT GTCTAGTCAA TACGTCTCAA AGCTTCTTGG TTACGCCTTA TTCTTTACTA ATCTTTGGGA CATGGGCTCA GGTACCTCGG GTTCAACTAT CTTAATGTAT AATGCTGCTG AGGGAGTGGC ATATTATATA CGGAATTACA CCTTCATTAA GCCTACATAT TCATACGATG TCACTCTTGG TTACCCTGCA ATATTCTATG GTCATTCACA GTGGGGTGGC TTATATCCTA AGATGAGTCC ATTACTAATG CTTCCAGCTC AAGTCATATC GCTTCCAGGT ATTTGGGCTT ACGTGAACTA CAGTACGGTT AAGTTCACTC CCGGTAACCT GCAGTTTGAC TTATCACTAC AGGTTTGGCT GGGGGCAACG CCTAATGAGA CCAGTGGAGC CCAGCCAGGT GATTTAGAGG TGATAATATA CTACTATACG CATAATATGG CCCCGGGTGG TTCAAATATG GGTACTATAA CTGTGCCTAC GTTTGTTAAC GGTAGTATTG TTGATGAGAG TTGGCAGGTT TGGGTTTACT ACGGTGGATC AAGCATGTGG ACTATAGCTT GGTTTGTACC AAGTATTAAT CAGCCAAGCG GCTACGTTGG ATTAAACATA ACTGGTATGC TCTACGACTT ATTCAATATA CTAGTTACCA ATTGGCCGCT GCAGTGGAAC TTCACTGGGT TACTGAACTA CTACTTATTC CAGCTTGGTA ACGACTTCGG TACATCAAGT CAATTAAGTA ACGTCTTCGA GAACATAACA GTATATAAGT ATTACCTTGA GTTAACTAAG CCATTAATAC CGATAATCAC GGTAACCAAC ACAACAACAA CCACAGTCAC CACCACGGCA ACAACCTACG TAACATCAAC AGTAACCAGC ACAACCACTA CCACTGTAAC CTCAACCACA ACCTCAGTTA GCACTACCAC TGTTACTTCA CCTGTGACTA GTACGACAAC GTTAACAACC ACATCACTCT TAACCACCAC TAGTACCGCA ACATTGGTAA GCACTTTAAC CAGTACCTTA ACAGTTACTA AGGAGGTTAT TGGGACAAGC GTAATAGTGG GCATAGTAAT CATAGTAATC GTAATAGCAG TCATTGCGGC GCTGCTGGTT AGGAGGAGGT AG
|
Protein sequence | MPAIGKWVTF LLAIMIAASL LVTYAQQVVP LVPAFNTTSN VVVIGNMSSQ YVSKLLGYAL FFTNLWDMGS GTSGSTILMY NAAEGVAYYI RNYTFIKPTY SYDVTLGYPA IFYGHSQWGG LYPKMSPLLM LPAQVISLPG IWAYVNYSTV KFTPGNLQFD LSLQVWLGAT PNETSGAQPG DLEVIIYYYT HNMAPGGSNM GTITVPTFVN GSIVDESWQV WVYYGGSSMW TIAWFVPSIN QPSGYVGLNI TGMLYDLFNI LVTNWPLQWN FTGLLNYYLF QLGNDFGTSS QLSNVFENIT VYKYYLELTK PLIPIITVTN TTTTTVTTTA TTYVTSTVTS TTTTTVTSTT TSVSTTTVTS PVTSTTTLTT TSLLTTTSTA TLVSTLTSTL TVTKEVIGTS VIVGIVIIVI VIAVIAALLV RRR
|
| |