Gene Hhal_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2363 
Symbol 
ID4709230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2592968 
End bp2594083 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content72% 
IMG OID639856838 
Productglycosyl transferase, group 1 
Protein accessionYP_001003928 
Protein GI121999141 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.629772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGTCC TGTACCTGTT CGACACCACC GACCGCGCCG AGAGCGAGAG CGTCATCGAG 
ATGGCCAGCC ACGGCGTGGT CCCGACCATC GTCTGCCAGC CGGATGCGCC CATGCGCGGC
CGCTTCGAGG CGGCCGGGCT GGAGGTCATC CCGGTCGCCA TGCGCAGCAA GGCGGACCGG
GCAGCCATCG CCGCGCTGCG CCGGCTGCGC CGGGAACGGA CCTTCGACCT GGTGCACGCC
TACTACAAGA TCGCCCTGAC CAACTACAAC CTGGCGGCGG TCGGGCTGCC CCGGGTGCCG
GTGGTGGCCT ACCGGGGGAT CATCGGCAAT CTGAGCTACT GGGACCCGTT CTCCTGGCTC
TCCTTCCTCG ACCCGCGCAT CGAGCGCATC GTCTGCGTCT GCGAGGCCAT CCGCCGGTAC
TTCCTGGACA AACCCTTCCT GCCCGGGACC CGGCTGTTCC GTCCGGAGCG GGTGGTGACC
ATCCACAAGG GCCACCGCCC GGCGTGGTAC CAGCAGCCCG ACGCCCGCCT GCCCGCCGAC
CTGGCCATCC CGCAGGGCGC ACCGGTCATC GGCTGTGTGG CCCGGATGAA GAAGCGCAAG
GGCATCGTCG AGCTGATCCG CGCCTTCGAG CAGATCCCCG CCGAGCACAA CGCCCACCTG
GTGCTGATCG GCCCCATCGA GTACCCCGCC ATCGAGCAGG CGGCGGCCCA CAGCCCGGCG
GCGGATCGCA TCCGCATCAC CGGCTACCGG GCCGATGCGC CCAAGATCGC CGGCGCCTTC
GACATCGCCA CCCTGCCCTC CCTGCGGCGC GAGGGCCTGC CGCGGGCGAT CATCGAGGCC
ATGGCGCAGG GCATCCCGGC GGTGGTCTCG GACTCCGGCG GCAACCCGGA GCTGGTCGAG
GACGGCGTCA GCGGCCGGGT CACCCCGGCG GGCGATGTGG ACGCGCTCGC CGCAGCCCTG
CGCGAACTGG TCGCCGACCC GGCGCTGCGG GGTCGTCTGG GGGCCGCCGC CCACGAGCGC
ATCGCCACCC GCTTGACCGT CGAGCGCACG GCGCGGGAGA CGCTGGCCCT TTACGCCGGC
GTGCTCGGCC AACGCTCGGG CGAGCCGGCG GCCTGA
 
Protein sequence
MRVLYLFDTT DRAESESVIE MASHGVVPTI VCQPDAPMRG RFEAAGLEVI PVAMRSKADR 
AAIAALRRLR RERTFDLVHA YYKIALTNYN LAAVGLPRVP VVAYRGIIGN LSYWDPFSWL
SFLDPRIERI VCVCEAIRRY FLDKPFLPGT RLFRPERVVT IHKGHRPAWY QQPDARLPAD
LAIPQGAPVI GCVARMKKRK GIVELIRAFE QIPAEHNAHL VLIGPIEYPA IEQAAAHSPA
ADRIRITGYR ADAPKIAGAF DIATLPSLRR EGLPRAIIEA MAQGIPAVVS DSGGNPELVE
DGVSGRVTPA GDVDALAAAL RELVADPALR GRLGAAAHER IATRLTVERT ARETLALYAG
VLGQRSGEPA A