Gene Hhal_1509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1509 
Symbol 
ID4711531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1633254 
End bp1634531 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID639855976 
Productglycosyl transferase, group 1 
Protein accessionYP_001003078 
Protein GI121998291 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGGCG GCCAAGTATC GGATCGCCGG CAAGGGCCGC GCGTGGTGGT CTTCACCCGC 
GTTTACCCCA ACGAGGCGCA GCCCACCTTC GGGGTCTTCG TGCGCGAGCG TATGACCCGG
GTGGCCGATT CCCTGCCGGT GGTTGTGGTG GCACCGGTAC CTTGGTTTCC GGGGGAGGGT
ATCGTCCGGC GCTGGAGGCC GTGGTATCGG CCCTCTGTGG CGTATTGCGA GGAACAAGCC
GGGACGACCG TCTACCACCC GCGTTTCCTC TGCCTGCCGG GTGTGCTGAA AAGCCTCGAC
GGTCTGTTCC AGGCCCTCGG GGCGTTGCCC ACCCTGCGGC GGCTGCGCCG AAGGGGTCAG
CTCGATTTGC TCGACGCGCA TTTCATCTAC CCGGACGGGG TGGCCGCCTG GCTCGCCGCT
TGGTGGCTGG GCTGCCCTTA CACGGTGACC CTGCGGGGGA CCATCGTGCG GATCTCGCGC
ACCCGGGTGC GCCGTTTCCT GGCGCGGCGA GCGCTACGCC ATGCCGCGCG GGTCTTCTCG
GTTTCGGAGT CGCTGCGCAG TGTGGCGCTG GGGATGGAGC CGCAGCGGTC CGTGGAGGTC
GTTCCGAACG GGATCAACCT GGCGCTATTC GGGCCTGAGG ATCGGATGGC CTGCCGGCGA
GCGCTCGGGA TCCCGGAGCG TGCCCAAGTG TTGATTACGG TCGGGACGCT GAACGAGCGC
AAGGGGTTTC ATCGGGTGAT CGAGCTGATG CCGTCCCTGG ACGAGCAGAT CGGCGACCTC
CACTACCTGG CTGTGGGAGG GGGGAGCCCC GACGGCAATG ACGAACAGCG CCTGCGGGAT
CTGGCAAAAG ATCTGGGGGT GGCGGAGCGT GTGCACTTTG CCGGTGCGGT GGCCTCGGAG
CGCCTTCGCT TTTACTACGC GGCGGCGGAC CTTTTCGTGC TCCCCACCCG TTTCGAAGGG
TGGGCGAATG TCTTCCTCGA GGCGGCCGCC TGCGGGCTGC CTACGGTGAC CACCGATGTC
GGGGGTAACG CCGAGGTGAT CGCTCACGAC CAGCTCGGGA CGCTGGTGCC GTTCGGGGAT
ACGGAGGCGC TGCGCCGGGC CATTGTTGAG GCCCTGGCGT ATGACTGGGA TCGGCAGGTG
ATCCTGGACT ACGCGCAGGC GAACGCCTGG CAGACCCGGA TCCCCTCCTT GGTGGACAAG
CTCGAGGCCG CCGCTGACGG CGCCGGGCAG GGTGCGCGTC GGCCTGCCGG CGGGGGCGGG
GAGGGTCCGT GCCGGTGA
 
Protein sequence
MTGGQVSDRR QGPRVVVFTR VYPNEAQPTF GVFVRERMTR VADSLPVVVV APVPWFPGEG 
IVRRWRPWYR PSVAYCEEQA GTTVYHPRFL CLPGVLKSLD GLFQALGALP TLRRLRRRGQ
LDLLDAHFIY PDGVAAWLAA WWLGCPYTVT LRGTIVRISR TRVRRFLARR ALRHAARVFS
VSESLRSVAL GMEPQRSVEV VPNGINLALF GPEDRMACRR ALGIPERAQV LITVGTLNER
KGFHRVIELM PSLDEQIGDL HYLAVGGGSP DGNDEQRLRD LAKDLGVAER VHFAGAVASE
RLRFYYAAAD LFVLPTRFEG WANVFLEAAA CGLPTVTTDV GGNAEVIAHD QLGTLVPFGD
TEALRRAIVE ALAYDWDRQV ILDYAQANAW QTRIPSLVDK LEAAADGAGQ GARRPAGGGG
EGPCR