Gene LGAS_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_0223 
Symbol 
ID4438749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp258371 
End bp259735 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content36% 
IMG OID639672083 
ProductHD superfamily phosphohydrolase 
Protein accessionYP_814070 
Protein GI116628898 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.237797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000000000130835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAAAT TTCAAAGTAA AAAGCTAGAT CATGAAAAAG TATTGCGTGA TCCAGTCCAC 
AATTATATTC ATGTTAAAGA TAAAGTCATT CTAGATATTA TTAATTCAAA AGAATTTCAA
CGCTTGCGCC GTATTAAACA GTTAGGACCT GCTTCCTATG TTTTTCAAGG AGCAACACAT
ACCCGCTTTG AACATAACTT GGGTGTTTAT GAATTAACAC GCCGAATCTG CGATATTTTT
GAAGAAAAAT ATACTAGTAA AGAACCCGGC GATGGATTAT GGGATCCTAA CGAACGTCTT
TTAGCAGAAT GTGCTGCCTT GCTTCATGAT ATTGGTCATG GTCCATACTC TCATACCTTT
GAGCATCTCT TTGGTACTAA CCATGAAAAA ATGGGTCAAC AAATTATTAC CGATAAAAGT
ACTGAAGTAA ACCAAGCTTT AAGACAAGTT AGTCCCAATT TTCCAGAATT AGTAGCTAGC
GTAATTGCTA AGACTTACTC TAACCCACAA GTTGTAAAAT TAATTTCTAG TCAAGCAGAT
GCTGACAGAA TGGATTACTT ACTTCGTGAT GCTTACTTTA CTGGGGTAAC TTATGGTAGT
TTTGATTTAA CCAGAATCTT AGAAGTGATT CGTCCGTACC GTGATGGAAT TTGCTTTACT
GATAAGGGCA TCCACGCTGT TGAAGACTAT ATTATTAGCC GCTACCAGAT GTATCAACAA
GTTTATTTTC ACCGCGTAAA TCGTTCAATG GAAGTCATCC TGCATCACTT ACTTGAAAGA
GCGCAGATTA TATATGAAGC AGGTAAGCTC CAAGTTACTC CACAACTAGA GGCCTTTCTA
AAAGGAAATT GGACACTTGA AGATTATCTT AATTTAGACG ACGGCGTAAT GGAAACTAAT
TTCTTATTAT GGACTAATTC AGGCGATCAA ATTCTATCAG ACCTTTCAAG TCGTTACTTA
TATCGTCATC CCCTTGAAAG TGTCAAGATT AATGAAGATA CTAAGAGTTT ACTACCAAAA
TTAAAGAATT TAATTAAACA AGCAGGTTTT GATCCTAACT ACTACACTGC CACTAATTCG
GCATTTGATG AGCCATATGA TGCTTATAAG CCTATTGGTA AAAATGCCCA CAGTCCGATT
GAAATCATGC AAGCTGATGG TAGCTTAGTC GAACTATCCG AGTTAAGTCC CCTCGTTAAA
TCTTTAAATG GTACGCTTCA GGGAGATGAA CGTTTCTTCT TTCCTAAAGT AATGGTTAAA
GAAACTGATG AGCCACAAAT TTTTGATCCA ATTTATCAAG AATTTCAGAA ATATATTAGA
AACAACACAT TGCGTTACTT AAGACGTCCA AATAAAAAGA AATAA
 
Protein sequence
MEKFQSKKLD HEKVLRDPVH NYIHVKDKVI LDIINSKEFQ RLRRIKQLGP ASYVFQGATH 
TRFEHNLGVY ELTRRICDIF EEKYTSKEPG DGLWDPNERL LAECAALLHD IGHGPYSHTF
EHLFGTNHEK MGQQIITDKS TEVNQALRQV SPNFPELVAS VIAKTYSNPQ VVKLISSQAD
ADRMDYLLRD AYFTGVTYGS FDLTRILEVI RPYRDGICFT DKGIHAVEDY IISRYQMYQQ
VYFHRVNRSM EVILHHLLER AQIIYEAGKL QVTPQLEAFL KGNWTLEDYL NLDDGVMETN
FLLWTNSGDQ ILSDLSSRYL YRHPLESVKI NEDTKSLLPK LKNLIKQAGF DPNYYTATNS
AFDEPYDAYK PIGKNAHSPI EIMQADGSLV ELSELSPLVK SLNGTLQGDE RFFFPKVMVK
ETDEPQIFDP IYQEFQKYIR NNTLRYLRRP NKKK