Gene Maqu_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMaqu_3103 
Symbol 
ID4657153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinobacter aquaeolei VT8 
KingdomBacteria 
Replicon accessionNC_008740 
Strand
Start bp3446723 
End bp3448411 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content59% 
IMG OID639813083 
ProductHAD family hydrolase 
Protein accessionYP_960364 
Protein GI120556013 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases
[COG0637] Predicted phosphatase/phosphohexomutase 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGAT TCGACGACAA CATCCTGCCA CCCTCGGTCC TGATCTTCGA CTGGCACGGT 
ACCCTGGTGG ACACCCATGA TGCCATGTTC AGCGCCATGG AGGAGATGTT ACCGCAACTG
GAAGAGCTGG GCCTCGTGGA GCAGTTGCTG CCTGAAGACC AGTGCCGCAC CGAAGACGAT
GCCCGGTTGG TGCGTTACAT TCGCATCTAT CGCCGATTGC ACCCGCGGAT TCTGGCTGAG
CGCCGGGTCT CCCGAACGGA TATTTTCAAC GCCATCTTTG GTGAGAACAA GGCTGCCAAG
CTGATCGCCC ATAAAGCCTA CAACCAGGCC TACCGGAAGT ACTTCGGCCA GGTGAAACCG
TTTCAGCCGG GCGCTTATGA GTATCTGTCG GCCCTGAAAG CCCTGGGCAT CAAGCTGGCG
GTGTGCACCA ACCGCAACCG GGAGTTCCTG GACAGGGAGT TGCAGATTGT CGACGAAGGC
CGCTGGCTGC ATTTGTTCGA TGCCACCGTG TGTGCTGATG ACGTCACTGA ATACAAGCCG
GATCCGGAGG TGATTCTCAA AGCGCTGGAA AAGCTGTCGA TCCAGGCGGA TGACCACGCC
TGGTATATCG GGGACAGCTA TGTGGACATG CTTACCGCAC ACCATGCCGG TGTGGCTGGA
GTGTTCTACA ACGGTGCCTG CTGGGAGCCC GACCGGGTCA GCGGCTGGTT CACCAAGCGA
GACGCCCCGA CCGCTGTTCT CGACAGCTTT GAAGATCTGA TGGATTTACT TGCCCTGCTG
GAACGTGAGC ATTCCGAAGC GTTTACCTCC AGCCCGGCAG AGGTGCGCCC GAAACCTTTC
CCGGCGCCAG AGCGGCCGGA GCCCCGCATC GAGCCTGACT GGCACCCGGC GGTGGTGAAA
CTGATTCGCC CGGCGGTGAT CCTGTTCGAC TGGCACGCCA CTCTGGTGGA CACGCTCGAT
GCCATGTACC ACGCGGTGGA TGACATGCTG CCGGATTTCC ACAAGCTGGG ATTGATGGAT
CGGATGGTGG CGCCGGAAGA CAGCAAAACC CCGGAAGATG CCCGCCTGGT GGCATACGTG
CGAGAGTTCG CCAAACTGCA CCCCAAGGTA AAGGCTGATC GCAAAATCTC CCGCACCGAT
ATTTTTGAAG TGCTATTCGG TGAGGATCAG GACGCCAAGC AGATTGCCCA CAAAGCCTTT
AATCATCACT ACCGGAATCA CTACGGCACC GTGAAAGCCT TCGAGCCCAG GGTGCGGGAA
ATGCTGGAAG GGCTACGCAA GCTGGGTATC CAGGTGGGCG TGATCACCAA CCGCGACCGG
GAGTTTTTCG AGCACGAACT GGCGGCGGTG GAAGATACCG GCTGGACACA CCTGTTTGAC
GTTAACGTGT GTGGTGATGA TACCCCGCTG CGCAAACCCC ACCCGGATCA ATTGCTGCTG
GCAGTGCAGA AGCTGGATTA CCCACCGGAC CCCAGTGTGT GGTACGTGGG CGACTCTACC
ACTGACATCA TTGCGGCTAA ACGGGCCGGT ATGACCGCGG TGTTCTTCAA CGGCGCCCAA
TGGGATTTGC CGTGGCTGAA CCGGATTTTT CCCGGTACTC ACAAACACCC GGACAAACCG
GATGTAGTGG TTAACGATTT CTCCGAGTTC TGGGCTCTGG TGCTGGCCTG CGAAGTCGGG
CCGCCCTGA
 
Protein sequence
MSRFDDNILP PSVLIFDWHG TLVDTHDAMF SAMEEMLPQL EELGLVEQLL PEDQCRTEDD 
ARLVRYIRIY RRLHPRILAE RRVSRTDIFN AIFGENKAAK LIAHKAYNQA YRKYFGQVKP
FQPGAYEYLS ALKALGIKLA VCTNRNREFL DRELQIVDEG RWLHLFDATV CADDVTEYKP
DPEVILKALE KLSIQADDHA WYIGDSYVDM LTAHHAGVAG VFYNGACWEP DRVSGWFTKR
DAPTAVLDSF EDLMDLLALL EREHSEAFTS SPAEVRPKPF PAPERPEPRI EPDWHPAVVK
LIRPAVILFD WHATLVDTLD AMYHAVDDML PDFHKLGLMD RMVAPEDSKT PEDARLVAYV
REFAKLHPKV KADRKISRTD IFEVLFGEDQ DAKQIAHKAF NHHYRNHYGT VKAFEPRVRE
MLEGLRKLGI QVGVITNRDR EFFEHELAAV EDTGWTHLFD VNVCGDDTPL RKPHPDQLLL
AVQKLDYPPD PSVWYVGDST TDIIAAKRAG MTAVFFNGAQ WDLPWLNRIF PGTHKHPDKP
DVVVNDFSEF WALVLACEVG PP