Gene Hhal_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1121 
Symbol 
ID4710081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1217408 
End bp1219195 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content66% 
IMG OID639855593 
Productglycoside hydrolase 15-related 
Protein accessionYP_001002699 
Protein GI121997912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACGC TCGACCAAGC CCTGATTGGC AACTGCGCGT TTGCCGCCCT GGTCAACCGC 
CAGGCCGAGA TCACTTGGGC GTGCATGCCC CGCTTCGATG GCGATCCGGT CTTCTGCTCC
CTGCTCGGCG ATCCCGCGGC CGGTGCCGGC GCCGGGCGTT TCGCCGTCGA GCTGGAGGGG
CTGGCCCGCA CCGAGCAGTG GTATGTTCGC AATACCGCCA TCGTGGTGAC GCGGCTCTAC
GACCACCAGG GCGGGGTGGT GGAGGTGACC GATTTCGCCC CCCGCTTCGA GCAGTTCGGC
CGCACCTTTC GCCCGGTGGA GCTGATCCGT CAGGTCCGTC GGGTTGCCGG CTCACCCCTG
GTCTCTCTGG TGGTGCGGCC GGTGTTCGAT CACGGCCGGT CGTTGCCCAG CGTGACTCAG
GGCAGCAACC ACGTGCGCTA CGTCGGCCCG GGCCAGGTCC TGCGCCTGAC CACCGACGCC
TCGTTGACCG CGGTCCTGGA CGAGACCCCG TTCATCCTCG AGGACGACCT GACGCTGGCC
TTCGGTCCCG ACGAGACCCT GCTCCAGTCG GCCCGTGAGA CCGGCTACCA GTTCTATGAG
CACACCCTAA CCTATTGGCA GGAGTGGGTG CGCAACCTGG CCATCCCCTT TGAGTGGCAG
GAGGCGGTGA TCCGCGCGGC GATCACCCTC AAGCTCAACA CCTACGAGGA CACCGGTGCG
GTGATCGCCG CGGTGACCAC TTCGATCCCC GAGGCCCCGG ACACCGGCCG CAACTGGGAT
TACCGTTACT GTTGGCTGCG GGACGCCTAC TTCGTGATCA ACGCCCTCAA CCGCCTGGGG
GCGACCAAGA CGATGGAGCA CTACGTGCGC TTCATCATCA ACACCACTGC CCGCAACGAG
GGGCGGTCGC TGCGGCCGGT GTATCGCATC AACGGGCGTG ATGACCTTTA CGAGACCATC
GCCTACGGCT TGCCCGGATA CCGGGAGATG GGGCCGGTGC GCATCGGCAA CCAGGCCTAC
GAGCAGCAGC AGAACGATGT TTACGGGTCA GTGATCCTGG CCACCGCGCA CTTGTTCTTC
GACGAGCGGC TGCGCCGCCG CGGCGATGAA TCGCTGTTCC GCCGTCTCGA GCAGCTCGGT
GAGCAGGCCG TCGCCGTCTA CCGCGAGCCG GATGCTGGCC CCTGGGAGTT CCGCGGCTTT
GAAAAGGTGC ATACCTTCTC GGCGGCGATC TGTTGGGCGG CTGCCCGGAA CCTGCGTGCC
ATCGCCGCCA AGCTGGGGTT GATGGAGCGG GCTGATTACT GGCGCCGCCG GGCCGACGAG
ATGGCCGACA CCATCCGCAA CAGCGCCTGG AACGAGCAAC GCAACAGCTA CATGGCCAGT
TTCGGCGGCC AGGACCTCGA CGCCAGCCTG ATGCTGCTCT ACGAGTGGGG GTTCCTGCGT
GCCGGAGACC CGCGCCTGGC CGGGACCGTG CGCGCCGTCG AGCAGGAGCT GCGCCACGGC
GACTTCCTCT TCCGTTACGT CCACGAGGAC GATTTCGGCA AGCCCCACAC GGCCTTCACC
ACCTGTACCT TCTGGTACAT CGACGCCCTG GCGGCGGTGG GCCGTGAGGC CGAGGCCCGC
GCACTGTTCG AGCGCCTGCT CGAATGTCGC AACCACGTGG GTCTGCTCTC GGAGGATATC
GATCCGTACA CCGGAGAACT CTGGGGGAAC TTCCCGCAGA CCTACAGCAT GGTCGGTCTG
ATCAACTCCG CAATGCGCCT GAGTCGCAGT TGGGAGGAAC CGCTGTGA
 
Protein sequence
MSTLDQALIG NCAFAALVNR QAEITWACMP RFDGDPVFCS LLGDPAAGAG AGRFAVELEG 
LARTEQWYVR NTAIVVTRLY DHQGGVVEVT DFAPRFEQFG RTFRPVELIR QVRRVAGSPL
VSLVVRPVFD HGRSLPSVTQ GSNHVRYVGP GQVLRLTTDA SLTAVLDETP FILEDDLTLA
FGPDETLLQS ARETGYQFYE HTLTYWQEWV RNLAIPFEWQ EAVIRAAITL KLNTYEDTGA
VIAAVTTSIP EAPDTGRNWD YRYCWLRDAY FVINALNRLG ATKTMEHYVR FIINTTARNE
GRSLRPVYRI NGRDDLYETI AYGLPGYREM GPVRIGNQAY EQQQNDVYGS VILATAHLFF
DERLRRRGDE SLFRRLEQLG EQAVAVYREP DAGPWEFRGF EKVHTFSAAI CWAAARNLRA
IAAKLGLMER ADYWRRRADE MADTIRNSAW NEQRNSYMAS FGGQDLDASL MLLYEWGFLR
AGDPRLAGTV RAVEQELRHG DFLFRYVHED DFGKPHTAFT TCTFWYIDAL AAVGREAEAR
ALFERLLECR NHVGLLSEDI DPYTGELWGN FPQTYSMVGL INSAMRLSRS WEEPL