Gene Hhal_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0771 
Symbol 
ID4710665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp855103 
End bp858171 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content68% 
IMG OID639855232 
Producthypothetical protein 
Protein accessionYP_001002351 
Protein GI121997564 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase
[COG1213] Predicted sugar nucleotidyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCAC AGCTAACTGA CTATAGTACG CTTATAATCC TCGGCGCCGG CGCCCCCCAC 
CGTGGGGACC TTCCAGCCGC TCTGCGCGAG CCGCGCTCCG GCACTTCGGT CCTGCAGTGG
CTGCTGGACG CGTCCGGCTG CTCGCTCGAG TGCACTACCT TCGTCGCTGG GTACCAAGCA
GACGCAATTC GAGAGCGTTA TCCCGATTTG GCGGTGGTGG AAAACCGGGA TTGGGAGCAC
ACGGGGAGTG GCGCTTCACT CCTGGCGGCG CCTTTTGGTA CGAACTCCCC CCTGCTGGTC
TGCTACAGCG ACATCCTCTT CCGTCAGACA GTGCCCACGG CGCTGGCGCG CTGCGAGGCC
GATGTGGCAG TGGCCTGGGA CAGCGCCTGG GACCACCGCT ACGCCGGGCG CGAGGCTGCG
GACCTGGCCC GCTGCGAGAA GGTGATGGTC AACGGCGAGC GCATCGAGCG CCTCGGCGCC
GACCTGCCGG TGGACTGGGC CGACGGCGAG TTCATTGGCC TGGTGCGCTT CTCCCCGCGC
GCCCTGGAGT GGCTGGAGCG CCTGCGCGCG GACGGCCCCG AGAGCCTGCG CCAGCGCCAC
CTCTCCGAAT ACATCGAATA CCTGCGTGCG GCCGGCCTTT CGGTCGCCGG CGTCGATGTC
GCTGGAGACT GGGCCGAGTT CAACGAACCA CGGGACATCG CCCACTTCAT TCTCGGTACC
AAGGCCGAGA CATTGGGCCG GCTGCGCGGC ATGGTCCGCC ACGCGGTCAT TCAGGACCAA
GTCGTATGCA CCGTCGCCGA GTGGCAAGCC GATCGCGCGG CGGTGATGGA CCAGGTGCGC
CAGCGCTTCA CCGGAGAGCG GCTGGTTGTC CGCTCCAGCG CCCGTAGCGA GGACTCCTTC
CACCATTCGA ACGCTGGCGG CTACGACAGC CTGCTCAACG TCGATCCTCG GAATGGCCTT
GAAGAGGCCG TCGAGCAAGT CATTGCTTCC TATGGGACGG CGGATGACGA CGACCAGGTG
CTCATCCAGC CCATGGTCAC CGACGTGGCC ATGAGCGGCG TCGCCTTTAC CCGTACCCTG
GAGCACGGGG CACCGTGGTA TGTGGTCAAC TACGAGACCG CCGGCGACAC CGAAGCCATC
ACCAGCGGGG CTAGCGGTGA CCACCGCACG CTGCTGCTGC GCCGTGGCGC GGAACCCGAA
ACGCTGCCGG AGCCACGGCT GGGGCCGCTG GTGGCCGCAC TGCACGAGAT CGAGGCCCTC
CTCGGTTACG ACGCCCTGGA CGTGGAATTC GCGCTCGACC CCGCCGGCTC GGTGCACATC
CTCCAGGTCC GACCCATCGC GGTAGATCGC AAGGGTAGCG ACCTGGACGA TAGTGCCTTC
GACGCCGCCA TGGCTGCCGC CCACGCCCGA TGGGAGACGC TAACCCCGGC CCCACCCCAC
CTGCCCGGGG ATCCGGCGCC GCTCTATGGC GTGATGCCCG ACTGGAATCC GGCGGAGATC
ATCGGCACCG CTCCGGGCGC CCTGGCCGCC AGTCTCTACC GTCACCTGAT CATGGACGAG
ACCTGGGCCA TCCAGCGCGC CGAGTACGGT TACCGCGACG TGCGCCCGGC TCCACTGCTT
GTGGAGTTCG CTGGCCACCC GTATGTGGAT GTGCGTGCCA GCTTTGCCTC GTTCCTGCCG
GCGGAACTCC CCGACGATTT GGCCGGCCGG CTGCTGAGCT TCTACCTGGA GTGGCTGCGC
CAGCGCCCGG AGCTCCACGA CAAGGTGGAG TTCGAGGTGG TGCCCACCTG CCTGGCCCCC
GGCTTCGAGG GCTGGGAGCG GCGCCTGCAG GAGGACGGCG GCTTCAGTAC CGACGAGGTC
CGCGCCCTGC GCGAGGGGCT ACGCCGGATC ACCGCCGGCG CCTTCCACCG CTGCGAGGAC
GATCTGGCGC AGATCGAGAC CCTGCGCCAA CGCTTCCAGG CGATCGAGGG CGATACCCGG
CTGGCACCAC TGGAGCGCGC ACGGATCCTA CTGGACGACT GCCGGCGGCT TGGGACACTG
CCGTTCTCCC ATCTGGCTCG CAGCGGTTTC GTCGCCGTAA CACTGCTGCG CGAAGCCGAG
GCGTGCGGGA TGATCAGCGC AACCGCCCGG GAGAGCTTTC TCTCCACGGT CCGAACGGTG
AGCCACCGGC TCACGGCGGA CGCACGGGCC ACCGCCACCG GGGAGATGAG CGAGCACGCC
TTTATCGCTC GTTACGGCCA CCTGCGCCCG GGCACCTATG ACATCACCTC GCCCCGTTAC
GACGCCGATC CAGAGCGCTT CCTCCGGCCG CTGGTGGAGC ACGCCCGGGA GGCTGCCATG
GAGGAAGAAA ACCCTGGGCC CTGGCAAGCC GAGCGCGCAG CCTTCTTCTC GGCACTGGCC
GAGCTCGGCC TGCCGGCCGA CCCGGAGCGG GTCGAGACCT TCCTGCGTCA GGCCATCGAG
GGGCGGGAAT ACGCCAAGTT CATCTTCAGC CGCAACCTGT CCGCGGCGCT GGAGGCACTG
GCTGAGGCGG GAGCACAGTA CGGTCTGGAG CGCGCCCAGG TCGCGCACCT GCCGCTGGAC
GAGCTACTCG CCCTGCGCTC CGCGGCGCGT TCTGATGAGG CGATCGCCCG TCATCTGAGG
ATGCGCGCCG ATGAGGAGGC CGAGGCCCGG CGGGTGGCAG GGGCCTGCGA GCTACCGCCG
CTGATCACCG GCCAGGCGGA CCTGGATGCC TTCGTCATTG GTGCCGACCG ACCCAACTTC
ATCGGCTCCG GCTGCATCAC GGCGGACTGC CTGGACCTCG GCGATCAACC GGCCGATGCG
GACCTAGACG TGTCCGGGCG GATCGTCCTC ATCCCTCAGG CCGATCCCGG CTACGATTGG
CTCTTCGGCC AGGGGATCAC CGGGCTGGTG ACCCTCTACG GCGGCGCCAA CTCGCACATG
GCCATCCGCG CTGCGGAATT CGGTCTACCG GCCGCCATCG GCATTGGCGA GCAACGCTAC
CGTGAATTGG CCCAAGCGCG AGTTGTCGAG CTTGCCCCGG CCAACGGCAT CCTGCGGGTG
GTCCGATGA
 
Protein sequence
MVPQLTDYST LIILGAGAPH RGDLPAALRE PRSGTSVLQW LLDASGCSLE CTTFVAGYQA 
DAIRERYPDL AVVENRDWEH TGSGASLLAA PFGTNSPLLV CYSDILFRQT VPTALARCEA
DVAVAWDSAW DHRYAGREAA DLARCEKVMV NGERIERLGA DLPVDWADGE FIGLVRFSPR
ALEWLERLRA DGPESLRQRH LSEYIEYLRA AGLSVAGVDV AGDWAEFNEP RDIAHFILGT
KAETLGRLRG MVRHAVIQDQ VVCTVAEWQA DRAAVMDQVR QRFTGERLVV RSSARSEDSF
HHSNAGGYDS LLNVDPRNGL EEAVEQVIAS YGTADDDDQV LIQPMVTDVA MSGVAFTRTL
EHGAPWYVVN YETAGDTEAI TSGASGDHRT LLLRRGAEPE TLPEPRLGPL VAALHEIEAL
LGYDALDVEF ALDPAGSVHI LQVRPIAVDR KGSDLDDSAF DAAMAAAHAR WETLTPAPPH
LPGDPAPLYG VMPDWNPAEI IGTAPGALAA SLYRHLIMDE TWAIQRAEYG YRDVRPAPLL
VEFAGHPYVD VRASFASFLP AELPDDLAGR LLSFYLEWLR QRPELHDKVE FEVVPTCLAP
GFEGWERRLQ EDGGFSTDEV RALREGLRRI TAGAFHRCED DLAQIETLRQ RFQAIEGDTR
LAPLERARIL LDDCRRLGTL PFSHLARSGF VAVTLLREAE ACGMISATAR ESFLSTVRTV
SHRLTADARA TATGEMSEHA FIARYGHLRP GTYDITSPRY DADPERFLRP LVEHAREAAM
EEENPGPWQA ERAAFFSALA ELGLPADPER VETFLRQAIE GREYAKFIFS RNLSAALEAL
AEAGAQYGLE RAQVAHLPLD ELLALRSAAR SDEAIARHLR MRADEEAEAR RVAGACELPP
LITGQADLDA FVIGADRPNF IGSGCITADC LDLGDQPADA DLDVSGRIVL IPQADPGYDW
LFGQGITGLV TLYGGANSHM AIRAAEFGLP AAIGIGEQRY RELAQARVVE LAPANGILRV
VR