Gene Cmaq_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1891 
Symbol 
ID5710237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1965979 
End bp1968885 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content45% 
IMG OID641276399 
Productheparinase II/III family protein 
Protein accessionYP_001541698 
Protein GI159042446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.857248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACCC TGGAGAATAA TAACATTAGA CTCATTATCC ATGAAGCCTC AGGTACAGTA 
GTTGAGCTAC TTGATAAGCG TAGTAATGCT CAACACTTGC TGGCCAGGAA GCCTGAGCTG
GAGCTCATGG AGCCAGGTAT GGGGATACTG GAGATTGAAC CATTCATTAA GAGTAGAAGG
AGCATAGTTA ATTTAAGCGA TGACTCAGTA ACATTGAGGG TTGAGGAGGA GGGTAGGGTT
TTAATTAAGG AGGTGAAGTT AATGAGGCAA GGCGCCTTAA TCACAATTAA GGCCACTGGG
GTGAGTAGGG TTAGGGAATT AATCCACGTG GCCTGCGGTA ACGGTGGCTA CTGGGGTGAG
GCATTGGGTG CAATGTATAA TTGCAGGTAT TTCGTGAAAT TCGGCTTCAC TGAACCACCC
AACAGCTTCT CCTCAGTGGG CCTTAAACCA CCAGTCTCCG GCTTCAGGTT TAGTAAACAC
TCGTACAGTG ATAGGTATTT CCCTGAGCTT AAGTGGATTG CCTTCATTAA TGAGGGTAGG
CTCACCGGCT TACTGGTTAA GTGCCTATCA CCCTGTTACG GTATTGTTGA GGATCAATTC
TTCAACACTG AGCTTAACCT AGTGGCTAAT GGTGAGGGTG AGGTGAGCCT AAAGTATGAG
TTAACGCTGT TTAACGGGTT AAGTAGGGTT GATTACGTTG ATGATGAGTT AATAATAGGG
ATTAACTCAC CCAGTGTAGT TAAGCCTGGG GACACTATTA ATGGTTCATT GAGCGTCTAC
TCGTTAACGG GCAGTGGAGG CTTCAGTATT AATGGTTACG TTAAGTTGGT TAAATCAATG
CCAACACTAG GCAGGAGGGG TTATGATGTT GATAGGGTTA GGGTTGGTGA ATCAAGACTT
GGGTTAACCC TGGAGCATGA TACTATTAAT CTCAAGCCTG GTGAAGTATC AACAGTAGGA
TTCACCACAG AGCCCATGAG GTGGAGCATG GAGGACACCC TATACGAGGT GCCTTACCTT
GAGTTTAACA TTAATGGGAA GGTGGCTTCA AGGGCATTCT CAATTAACCC AGATTACGCA
GCGGCCCTTA ATGCATTAGG GAGAAGGAAT CCCGGGTTGG TGAATCATGT TGGTGATTGG
AGTGATGAGG TTGAGGGATT CTATGATGAT AAGTCAGCCT CAATACCCAT ATATGAGTTG
GCTGCTGAGG ATTTCTCAAC ATCCCGTAGG CTAATTAAGG TGAGGCAATT ACCTGAGTGG
GCTGTCCGGG TTCTTAAGGA GTATTTAAGC GGTGATGTTA AGGTTTACCC AGCGATATTC
CTTGACTTAA GTAAGGCTAC TAGGGATGGT TACGTTACCT CAGCCTTAGC CGACATGATT
CTTAAATCAG CCTCAAGCCA CGTATTCCTA GGTTCACCGA TTAATGATGC CTTAAAGGGT
TTAGAAATGG TGGCTTCAGC GTATGAGAGG GGTGAGTTAA TTCACTGGTT TAATGGTATT
CACGGTGGAG CTGGTTCAGC AGGCATGCTT CAATTAATCC TAGCCTACGA CTTAATTGAG
GATGAGCTCC CTGAGGAATT GAAAACTAGG TTAAAGCTAA TGTTCAGGTG GGCTCAGGGT
GAATTAATTA AATTAACCAA CGCCTGGGCT GGTAATTGGG AATTAACGGA AGCTTTAGCA
TTACTGGCTA TTTCAAGTAA ATTCAACTTC AATAACTCTA AACTAGGGCT CATTAAGGCT
GAATCAGTGT TAAGGAGCAC GTTGAATTAC TTCCTTAATG ATGGTGGTTG GCTTGAGGAG
TCAGCGGGAT ACCATAATGC AGTATTAAAC ATGGTTACCT GGGGGGCTGA GTTACTTAGG
CTTAATGGGA TTGACCTATA CTCAATTACG AGTAATGGTG AACCAGTGAT TAAGAAGGCT
GCGTACTGGC TTTGGAATGT ACTCGACCCA CGCTACAGGA CACCGGCCCT TGAGGATAGT
GGCGATGATA TACCTAACCC AGACCCATTC ATAGTGGGTG GGGTTAGGTA TAATGACCCA
GTGCTCCTTA AAGTTGGTTT AAGGCTTATG GAACTTGGCT CAAGGCCAAC ATCACTATTC
AGCGCATTAG CCTTAGCCGA CGGCCATGAT TTAATTAAAT CCCCCATTGA ACCCAGGCAT
GAACAAGTCA CTGTGCTTGA TGACTCAGGT AGATTCATAG TAAGGAGTAG TGATGAACCT
AACGCAACCT ACTTCATACT TGACTACGGC CCTCACGGTG CATGGCATGG TCACCCAGAT
AAGTTAAGCT TCGAACTACA CTCAAACGGG GAACCACTCA TTGTTGACGC TGGTTCAGGT
GGATACTACT CTGATCTTCA CTGGAAGTGG AGTAGGAGGA GCATAGCTCA CAATACTGTG
ACCCTGGAGG ATAAGGATCA ATTAGAGACT AGGGGGAGGT TAGTGAGGTA TTGGGTTAAT
GGTAATGACG TCTACGCGGT GTTTGAGGCT AATACATACC CAGGTGTTAA TCATAAGAGG
GGGGTGGTTG CCTTAGGTAA ATTAATATAC GTGGTATTAG ACAAGATTAA TGGGGTAGGT
AAATTCAGAT GGAGTATACA CTGCATGGGT GATGTAGTCT ACATGAGGAA GAACAGTATT
GCATTAACCA CAGGGAGCAC TGATTACGTT ATCGCATTAC CAAAGACACC TGAGGTGACG
TATGGGTGGA GAGGCCATAG TATTAGAACC GTGTACATGT ATTACGAGGA TTACAGTGAT
GGTGAGTTAA CCATGTGGGG CATTATCATA CCCTTCAAGG CTGAAGTAAG CTTTAACGGT
AGTGAAGTAG TCATAAGTAA TGGGGGCCTT AACTATATAG TGAGACCGCT TGAGCTCTAT
AATTCATTAT TCAATAACTT ATATTGA
 
Protein sequence
MITLENNNIR LIIHEASGTV VELLDKRSNA QHLLARKPEL ELMEPGMGIL EIEPFIKSRR 
SIVNLSDDSV TLRVEEEGRV LIKEVKLMRQ GALITIKATG VSRVRELIHV ACGNGGYWGE
ALGAMYNCRY FVKFGFTEPP NSFSSVGLKP PVSGFRFSKH SYSDRYFPEL KWIAFINEGR
LTGLLVKCLS PCYGIVEDQF FNTELNLVAN GEGEVSLKYE LTLFNGLSRV DYVDDELIIG
INSPSVVKPG DTINGSLSVY SLTGSGGFSI NGYVKLVKSM PTLGRRGYDV DRVRVGESRL
GLTLEHDTIN LKPGEVSTVG FTTEPMRWSM EDTLYEVPYL EFNINGKVAS RAFSINPDYA
AALNALGRRN PGLVNHVGDW SDEVEGFYDD KSASIPIYEL AAEDFSTSRR LIKVRQLPEW
AVRVLKEYLS GDVKVYPAIF LDLSKATRDG YVTSALADMI LKSASSHVFL GSPINDALKG
LEMVASAYER GELIHWFNGI HGGAGSAGML QLILAYDLIE DELPEELKTR LKLMFRWAQG
ELIKLTNAWA GNWELTEALA LLAISSKFNF NNSKLGLIKA ESVLRSTLNY FLNDGGWLEE
SAGYHNAVLN MVTWGAELLR LNGIDLYSIT SNGEPVIKKA AYWLWNVLDP RYRTPALEDS
GDDIPNPDPF IVGGVRYNDP VLLKVGLRLM ELGSRPTSLF SALALADGHD LIKSPIEPRH
EQVTVLDDSG RFIVRSSDEP NATYFILDYG PHGAWHGHPD KLSFELHSNG EPLIVDAGSG
GYYSDLHWKW SRRSIAHNTV TLEDKDQLET RGRLVRYWVN GNDVYAVFEA NTYPGVNHKR
GVVALGKLIY VVLDKINGVG KFRWSIHCMG DVVYMRKNSI ALTTGSTDYV IALPKTPEVT
YGWRGHSIRT VYMYYEDYSD GELTMWGIII PFKAEVSFNG SEVVISNGGL NYIVRPLELY
NSLFNNLY