Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1363 |
Symbol | |
ID | 5710343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1436272 |
End bp | 1437681 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641275871 |
Product | sulfatase |
Protein accession | YP_001541179 |
Protein GI | 159041927 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0156455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAGC CTGATTTAAA CATAATTGTG ATTGTTTTAG ATAGTCTTAG GCAGGATCAC GTCGGCTTCT ATAGGAGCCT ATATGGTTGG CCCAGGGTTT TTGATAATGT CCCACCACCG GATACGCCTA ACTTGGATAA GTTGGCTTCA GAAGGCATCG TATTCACTAA TGCTTACCCA TCAGGGTTAC CGACAATACC TGTTAGGGAG GAATTATTGA CTGGGCAATT CACGCTACCA TACCACCCAT GGTCACCAAT GCACCCCGAC TCATACACAA TGCCCGAACT ACTGAGGGGT TTAGGCTACT TCACTGGCCT AGTATCAGAC ACCTACCACT TATTTAAGCC AGGCATGAAT TACACTAAGG GCTTTGACAC ATGGTTCTTC ATTAGGGGTC AGGAGTATGA TACATACGGT ATACCGCCTC CGGTTAATAG GCGTGTTGAT GATTATGTTA CTAAGGATTA TTACAGTAAT TACGCTGGTT CAAGGGCTTA TGTTGAGCTT GTTGCCCAAT TCCTAGCCAA TATAGATGAT TGGAGGGATG AGGGTGATTG GTTTGCGGCT AGGGTTTTTA GGACTGCGAT TAATTGGGTT AAGGATGCTT ACAGGAAGTA TTCAAGGTTC ATGCTTTGGA TTGATAGTTT CGACCCACAT GAACCATGGA TCCCGCCGTC AAGGTTCGAT AAGTACACTG ACCCAGGTTA CAAGGGACCT AAATTAATAC TACCCATGGG TGGTGATGCA GCTAAGTGGT ATACCAATGA GCAGGTTAAC TACATTAGGG GTCTTTACGC CGGTGAAGTC GCGTACGTTG ACTACTACTT TGGTGAATTC TATAATGCAT TAAGGGATCT AGGACTCCTT GAACAATCCA TAGTAATACT CCTAGCTGAT CACGGCCACC CACTGGCTGA TCATGGGAAG TTCCTTAAGG GTGGTGATAG ACTTTACAGT GAACTACTAA AGGTACCATT CATGGTTAGG CTACCTAATG GTAGGCACAT TGTTACTGAT GCCATTGTTC AATTCCCAGA TGTCTTACCA ACAATACTTG GTTTACTTAA CCTACCTGAA ACATACACAT ACCCACTTGC CGGTAGGAGC TTCGCGGATT TACTTAATGG TTCATCAAGG GGGCATAGGG CTTACGCAAT AATGGGTTAC CATGAGGCTG CTGATAGGTG TATTAGGGAT GGTGAATGGA GCCTAATCTA TAGGCCTGAT GGTAGGCACG AATTATATAA CCTAGTAAAG GACCCAAGGG AGAGGGTTAA CTTGGCTAGT GAAATGCCTG ATAAGGTTAA TGAAATGATG AGTAAACTAG CGTTATGGTT CATGAATAGG AGTAGGCCAG TGAGGCAGAT ACAGGCTAGG TATGAATTAG GGGGGACTGG TAAGGCTTGA
|
Protein sequence | MSKPDLNIIV IVLDSLRQDH VGFYRSLYGW PRVFDNVPPP DTPNLDKLAS EGIVFTNAYP SGLPTIPVRE ELLTGQFTLP YHPWSPMHPD SYTMPELLRG LGYFTGLVSD TYHLFKPGMN YTKGFDTWFF IRGQEYDTYG IPPPVNRRVD DYVTKDYYSN YAGSRAYVEL VAQFLANIDD WRDEGDWFAA RVFRTAINWV KDAYRKYSRF MLWIDSFDPH EPWIPPSRFD KYTDPGYKGP KLILPMGGDA AKWYTNEQVN YIRGLYAGEV AYVDYYFGEF YNALRDLGLL EQSIVILLAD HGHPLADHGK FLKGGDRLYS ELLKVPFMVR LPNGRHIVTD AIVQFPDVLP TILGLLNLPE TYTYPLAGRS FADLLNGSSR GHRAYAIMGY HEAADRCIRD GEWSLIYRPD GRHELYNLVK DPRERVNLAS EMPDKVNEMM SKLALWFMNR SRPVRQIQAR YELGGTGKA
|
| |