Gene Cmaq_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1363 
Symbol 
ID5710343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1436272 
End bp1437681 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content44% 
IMG OID641275871 
Productsulfatase 
Protein accessionYP_001541179 
Protein GI159041927 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0156455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGC CTGATTTAAA CATAATTGTG ATTGTTTTAG ATAGTCTTAG GCAGGATCAC 
GTCGGCTTCT ATAGGAGCCT ATATGGTTGG CCCAGGGTTT TTGATAATGT CCCACCACCG
GATACGCCTA ACTTGGATAA GTTGGCTTCA GAAGGCATCG TATTCACTAA TGCTTACCCA
TCAGGGTTAC CGACAATACC TGTTAGGGAG GAATTATTGA CTGGGCAATT CACGCTACCA
TACCACCCAT GGTCACCAAT GCACCCCGAC TCATACACAA TGCCCGAACT ACTGAGGGGT
TTAGGCTACT TCACTGGCCT AGTATCAGAC ACCTACCACT TATTTAAGCC AGGCATGAAT
TACACTAAGG GCTTTGACAC ATGGTTCTTC ATTAGGGGTC AGGAGTATGA TACATACGGT
ATACCGCCTC CGGTTAATAG GCGTGTTGAT GATTATGTTA CTAAGGATTA TTACAGTAAT
TACGCTGGTT CAAGGGCTTA TGTTGAGCTT GTTGCCCAAT TCCTAGCCAA TATAGATGAT
TGGAGGGATG AGGGTGATTG GTTTGCGGCT AGGGTTTTTA GGACTGCGAT TAATTGGGTT
AAGGATGCTT ACAGGAAGTA TTCAAGGTTC ATGCTTTGGA TTGATAGTTT CGACCCACAT
GAACCATGGA TCCCGCCGTC AAGGTTCGAT AAGTACACTG ACCCAGGTTA CAAGGGACCT
AAATTAATAC TACCCATGGG TGGTGATGCA GCTAAGTGGT ATACCAATGA GCAGGTTAAC
TACATTAGGG GTCTTTACGC CGGTGAAGTC GCGTACGTTG ACTACTACTT TGGTGAATTC
TATAATGCAT TAAGGGATCT AGGACTCCTT GAACAATCCA TAGTAATACT CCTAGCTGAT
CACGGCCACC CACTGGCTGA TCATGGGAAG TTCCTTAAGG GTGGTGATAG ACTTTACAGT
GAACTACTAA AGGTACCATT CATGGTTAGG CTACCTAATG GTAGGCACAT TGTTACTGAT
GCCATTGTTC AATTCCCAGA TGTCTTACCA ACAATACTTG GTTTACTTAA CCTACCTGAA
ACATACACAT ACCCACTTGC CGGTAGGAGC TTCGCGGATT TACTTAATGG TTCATCAAGG
GGGCATAGGG CTTACGCAAT AATGGGTTAC CATGAGGCTG CTGATAGGTG TATTAGGGAT
GGTGAATGGA GCCTAATCTA TAGGCCTGAT GGTAGGCACG AATTATATAA CCTAGTAAAG
GACCCAAGGG AGAGGGTTAA CTTGGCTAGT GAAATGCCTG ATAAGGTTAA TGAAATGATG
AGTAAACTAG CGTTATGGTT CATGAATAGG AGTAGGCCAG TGAGGCAGAT ACAGGCTAGG
TATGAATTAG GGGGGACTGG TAAGGCTTGA
 
Protein sequence
MSKPDLNIIV IVLDSLRQDH VGFYRSLYGW PRVFDNVPPP DTPNLDKLAS EGIVFTNAYP 
SGLPTIPVRE ELLTGQFTLP YHPWSPMHPD SYTMPELLRG LGYFTGLVSD TYHLFKPGMN
YTKGFDTWFF IRGQEYDTYG IPPPVNRRVD DYVTKDYYSN YAGSRAYVEL VAQFLANIDD
WRDEGDWFAA RVFRTAINWV KDAYRKYSRF MLWIDSFDPH EPWIPPSRFD KYTDPGYKGP
KLILPMGGDA AKWYTNEQVN YIRGLYAGEV AYVDYYFGEF YNALRDLGLL EQSIVILLAD
HGHPLADHGK FLKGGDRLYS ELLKVPFMVR LPNGRHIVTD AIVQFPDVLP TILGLLNLPE
TYTYPLAGRS FADLLNGSSR GHRAYAIMGY HEAADRCIRD GEWSLIYRPD GRHELYNLVK
DPRERVNLAS EMPDKVNEMM SKLALWFMNR SRPVRQIQAR YELGGTGKA