Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1891 |
Symbol | |
ID | 5710237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1965979 |
End bp | 1968885 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641276399 |
Product | heparinase II/III family protein |
Protein accession | YP_001541698 |
Protein GI | 159042446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.857248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAACCC TGGAGAATAA TAACATTAGA CTCATTATCC ATGAAGCCTC AGGTACAGTA GTTGAGCTAC TTGATAAGCG TAGTAATGCT CAACACTTGC TGGCCAGGAA GCCTGAGCTG GAGCTCATGG AGCCAGGTAT GGGGATACTG GAGATTGAAC CATTCATTAA GAGTAGAAGG AGCATAGTTA ATTTAAGCGA TGACTCAGTA ACATTGAGGG TTGAGGAGGA GGGTAGGGTT TTAATTAAGG AGGTGAAGTT AATGAGGCAA GGCGCCTTAA TCACAATTAA GGCCACTGGG GTGAGTAGGG TTAGGGAATT AATCCACGTG GCCTGCGGTA ACGGTGGCTA CTGGGGTGAG GCATTGGGTG CAATGTATAA TTGCAGGTAT TTCGTGAAAT TCGGCTTCAC TGAACCACCC AACAGCTTCT CCTCAGTGGG CCTTAAACCA CCAGTCTCCG GCTTCAGGTT TAGTAAACAC TCGTACAGTG ATAGGTATTT CCCTGAGCTT AAGTGGATTG CCTTCATTAA TGAGGGTAGG CTCACCGGCT TACTGGTTAA GTGCCTATCA CCCTGTTACG GTATTGTTGA GGATCAATTC TTCAACACTG AGCTTAACCT AGTGGCTAAT GGTGAGGGTG AGGTGAGCCT AAAGTATGAG TTAACGCTGT TTAACGGGTT AAGTAGGGTT GATTACGTTG ATGATGAGTT AATAATAGGG ATTAACTCAC CCAGTGTAGT TAAGCCTGGG GACACTATTA ATGGTTCATT GAGCGTCTAC TCGTTAACGG GCAGTGGAGG CTTCAGTATT AATGGTTACG TTAAGTTGGT TAAATCAATG CCAACACTAG GCAGGAGGGG TTATGATGTT GATAGGGTTA GGGTTGGTGA ATCAAGACTT GGGTTAACCC TGGAGCATGA TACTATTAAT CTCAAGCCTG GTGAAGTATC AACAGTAGGA TTCACCACAG AGCCCATGAG GTGGAGCATG GAGGACACCC TATACGAGGT GCCTTACCTT GAGTTTAACA TTAATGGGAA GGTGGCTTCA AGGGCATTCT CAATTAACCC AGATTACGCA GCGGCCCTTA ATGCATTAGG GAGAAGGAAT CCCGGGTTGG TGAATCATGT TGGTGATTGG AGTGATGAGG TTGAGGGATT CTATGATGAT AAGTCAGCCT CAATACCCAT ATATGAGTTG GCTGCTGAGG ATTTCTCAAC ATCCCGTAGG CTAATTAAGG TGAGGCAATT ACCTGAGTGG GCTGTCCGGG TTCTTAAGGA GTATTTAAGC GGTGATGTTA AGGTTTACCC AGCGATATTC CTTGACTTAA GTAAGGCTAC TAGGGATGGT TACGTTACCT CAGCCTTAGC CGACATGATT CTTAAATCAG CCTCAAGCCA CGTATTCCTA GGTTCACCGA TTAATGATGC CTTAAAGGGT TTAGAAATGG TGGCTTCAGC GTATGAGAGG GGTGAGTTAA TTCACTGGTT TAATGGTATT CACGGTGGAG CTGGTTCAGC AGGCATGCTT CAATTAATCC TAGCCTACGA CTTAATTGAG GATGAGCTCC CTGAGGAATT GAAAACTAGG TTAAAGCTAA TGTTCAGGTG GGCTCAGGGT GAATTAATTA AATTAACCAA CGCCTGGGCT GGTAATTGGG AATTAACGGA AGCTTTAGCA TTACTGGCTA TTTCAAGTAA ATTCAACTTC AATAACTCTA AACTAGGGCT CATTAAGGCT GAATCAGTGT TAAGGAGCAC GTTGAATTAC TTCCTTAATG ATGGTGGTTG GCTTGAGGAG TCAGCGGGAT ACCATAATGC AGTATTAAAC ATGGTTACCT GGGGGGCTGA GTTACTTAGG CTTAATGGGA TTGACCTATA CTCAATTACG AGTAATGGTG AACCAGTGAT TAAGAAGGCT GCGTACTGGC TTTGGAATGT ACTCGACCCA CGCTACAGGA CACCGGCCCT TGAGGATAGT GGCGATGATA TACCTAACCC AGACCCATTC ATAGTGGGTG GGGTTAGGTA TAATGACCCA GTGCTCCTTA AAGTTGGTTT AAGGCTTATG GAACTTGGCT CAAGGCCAAC ATCACTATTC AGCGCATTAG CCTTAGCCGA CGGCCATGAT TTAATTAAAT CCCCCATTGA ACCCAGGCAT GAACAAGTCA CTGTGCTTGA TGACTCAGGT AGATTCATAG TAAGGAGTAG TGATGAACCT AACGCAACCT ACTTCATACT TGACTACGGC CCTCACGGTG CATGGCATGG TCACCCAGAT AAGTTAAGCT TCGAACTACA CTCAAACGGG GAACCACTCA TTGTTGACGC TGGTTCAGGT GGATACTACT CTGATCTTCA CTGGAAGTGG AGTAGGAGGA GCATAGCTCA CAATACTGTG ACCCTGGAGG ATAAGGATCA ATTAGAGACT AGGGGGAGGT TAGTGAGGTA TTGGGTTAAT GGTAATGACG TCTACGCGGT GTTTGAGGCT AATACATACC CAGGTGTTAA TCATAAGAGG GGGGTGGTTG CCTTAGGTAA ATTAATATAC GTGGTATTAG ACAAGATTAA TGGGGTAGGT AAATTCAGAT GGAGTATACA CTGCATGGGT GATGTAGTCT ACATGAGGAA GAACAGTATT GCATTAACCA CAGGGAGCAC TGATTACGTT ATCGCATTAC CAAAGACACC TGAGGTGACG TATGGGTGGA GAGGCCATAG TATTAGAACC GTGTACATGT ATTACGAGGA TTACAGTGAT GGTGAGTTAA CCATGTGGGG CATTATCATA CCCTTCAAGG CTGAAGTAAG CTTTAACGGT AGTGAAGTAG TCATAAGTAA TGGGGGCCTT AACTATATAG TGAGACCGCT TGAGCTCTAT AATTCATTAT TCAATAACTT ATATTGA
|
Protein sequence | MITLENNNIR LIIHEASGTV VELLDKRSNA QHLLARKPEL ELMEPGMGIL EIEPFIKSRR SIVNLSDDSV TLRVEEEGRV LIKEVKLMRQ GALITIKATG VSRVRELIHV ACGNGGYWGE ALGAMYNCRY FVKFGFTEPP NSFSSVGLKP PVSGFRFSKH SYSDRYFPEL KWIAFINEGR LTGLLVKCLS PCYGIVEDQF FNTELNLVAN GEGEVSLKYE LTLFNGLSRV DYVDDELIIG INSPSVVKPG DTINGSLSVY SLTGSGGFSI NGYVKLVKSM PTLGRRGYDV DRVRVGESRL GLTLEHDTIN LKPGEVSTVG FTTEPMRWSM EDTLYEVPYL EFNINGKVAS RAFSINPDYA AALNALGRRN PGLVNHVGDW SDEVEGFYDD KSASIPIYEL AAEDFSTSRR LIKVRQLPEW AVRVLKEYLS GDVKVYPAIF LDLSKATRDG YVTSALADMI LKSASSHVFL GSPINDALKG LEMVASAYER GELIHWFNGI HGGAGSAGML QLILAYDLIE DELPEELKTR LKLMFRWAQG ELIKLTNAWA GNWELTEALA LLAISSKFNF NNSKLGLIKA ESVLRSTLNY FLNDGGWLEE SAGYHNAVLN MVTWGAELLR LNGIDLYSIT SNGEPVIKKA AYWLWNVLDP RYRTPALEDS GDDIPNPDPF IVGGVRYNDP VLLKVGLRLM ELGSRPTSLF SALALADGHD LIKSPIEPRH EQVTVLDDSG RFIVRSSDEP NATYFILDYG PHGAWHGHPD KLSFELHSNG EPLIVDAGSG GYYSDLHWKW SRRSIAHNTV TLEDKDQLET RGRLVRYWVN GNDVYAVFEA NTYPGVNHKR GVVALGKLIY VVLDKINGVG KFRWSIHCMG DVVYMRKNSI ALTTGSTDYV IALPKTPEVT YGWRGHSIRT VYMYYEDYSD GELTMWGIII PFKAEVSFNG SEVVISNGGL NYIVRPLELY NSLFNNLY
|
| |