Gene Hhal_1573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1573 
Symbol 
ID4709833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1712998 
End bp1714083 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content63% 
IMG OID639856037 
Productglycosyl transferase, group 1 
Protein accessionYP_001003139 
Protein GI121998352 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATCC TGTTTGTCTC CACGCTCAGT CACATCCCAC AGCGAGCGAG CGGGGCCAAT 
AGCTCAACCG ATCAGCTCTG CCGCGGACTC TCATCGAGGG GGCACCACGT GGCTGTCCTA
TCGAGTCTGA GAGGTGGAGA CCGACTCGCT TGGTGGAACC GCCTACGGCA CCACCTGACA
CCGGGGCAGA ACCTCCCACC GGACCGGGCA TGCGGCTACC TGACCTACCG CGGTTATGCT
CCCCTGAAAG ATCTCGATGA GGCCTTGGGT GACTTCCGGC CCGACATTGC CGTTGCTGAT
GTCGGAAAGA CCAGCCGTAT CGCTGAAGCG CTACGGGCGC GCAGAGTCCC CACAGTGGCC
TATCTACGCG ACCTGGAACC GGAGACGTTC TCGGATCACC CCGTCACTGA TCCACAGGTG
AGCTTCATCG CCAATTCTCA GTTCACTGCC TCAGCCTATG CACAGCACCG CGACCTCCAG
TGTCAGATTA TCCGCCCGCT GGTTCACCCG GAGTATTACC GCGTGAATTC AACGCGCGAA
GTCGCACTCC ACATCAACCC CTCCCCGAAG AAAGGCATCG ACATCACGCT GAATCTGGCC
GAGGCTCGCC TGGACATCCC GTTCCTGCTG GTCGAAACCT GGAGTCTGGC GCGCGAGCTG
CGCGAGCACT ACCGCCGCCG TGCCGCCGAA TTGCCCAACG TGCGTGTCGT CCCGACTCAA
AGGGATATGC GAGCCTATTA CGGGCGCACC CGGGTCGTCC TGGCGCCGAG TATCTGGCCG
GAGACTTGGG GTCGCATTGC TACAGAGGCT CATATCAGCG GCATCCCGGT CTTGGCCAGC
ACACGCGGCG GCCTGCCCGA GTCGGTGGGT CCGGGCGGAA CGCTGCTGGA TCCCGAAGCG
CCTTTGGAGC AGTGGGTCGA GGCCCTCGGC AGGCTTTGGG ACGATAGGAC GGTCTACGCA
AGGCTAAGCC GTGCCGCCCT GGACCACGCC GACCGGCCGG AGATTCACCC CAAACGGGTT
TTGGATGACT TTTCAGGCGT TCTCGAACAA ACCCGCGCGA TGGCCACAGC GTCGACTCCC
GCCTAA
 
Protein sequence
MRILFVSTLS HIPQRASGAN SSTDQLCRGL SSRGHHVAVL SSLRGGDRLA WWNRLRHHLT 
PGQNLPPDRA CGYLTYRGYA PLKDLDEALG DFRPDIAVAD VGKTSRIAEA LRARRVPTVA
YLRDLEPETF SDHPVTDPQV SFIANSQFTA SAYAQHRDLQ CQIIRPLVHP EYYRVNSTRE
VALHINPSPK KGIDITLNLA EARLDIPFLL VETWSLAREL REHYRRRAAE LPNVRVVPTQ
RDMRAYYGRT RVVLAPSIWP ETWGRIATEA HISGIPVLAS TRGGLPESVG PGGTLLDPEA
PLEQWVEALG RLWDDRTVYA RLSRAALDHA DRPEIHPKRV LDDFSGVLEQ TRAMATASTP
A