Gene Hhal_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1066 
Symbol 
ID4709844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1157294 
End bp1158955 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content71% 
IMG OID639855537 
Producthypothetical protein 
Protein accessionYP_001002644 
Protein GI121997857 
COG category[R] General function prediction only 
COG ID[COG3044] Predicted ATPase of the ABC class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGC TGCAGACGCT CCTCCAGCGC CTCGACGGTC GAGGCTACAA GGCGTACAAG 
GACATCCAGG GCCGCCACGC GTTCCCTGGT TTCACCCTGT GCGTCGACAA GGTCCAGGCT
GACCCCTTCG CCGCTCCCTC GCGGATGCGG GCCGTCGTAC CGTGGCAACA GGTCGCCTTA
CCGGAGCCGG CCCTGGCCGG AGAGCCCCGG CGCCGCGCGG CGCGCGACTA TATCGCGCGG
GCCTTCGCGA ACGAAGCAGA CGGTGAGCAG GCGGTGCGCA TCGATGCCGG CGCGCAGACG
GTGCTCGATC GCAGCGCCGT GCTGTTCACC GATGCCGGGG TCGAGCTGCG CTTCACCGTT
GCCTTGCCGG CCCGGGGGCG GACCATCCTC GGGCGCCAGG CCGCCGAGAT CCTCTGCCGG
AAGCTGCCGC GGATCGTCGC CGGGGCTACC GAGGCGCAGC GGCTGGATGC CGCCGCGCTG
GAGCGCCACT GCGCGGCGGT GGAGGACCAG CACGCCCTGC GCGGACAGCT TGCCGAGCGT
GGTCTGGTCG CCTTTGTCGC CGATGGGGCC GTGCTGCCGC GGCGCTCCGG CGTGGACGAC
CGGCCCTTGG CGGACGCCAT CCCCTTCGAC AGCCCCGATT CGTTGCGGGT GACGCTGGAG
GCGCCGAACG CCGGCTCGAT CTCCGGCATG GGGATCCCGC GCGGGATCAC CCTGATTGTC
GGGGGTGGCT TCCACGGCAA GTCGACCCTG CTCAACGCCC TGGAACTCGG TGTCTACGAC
CACAGGCCGG GGGACGGGCG TGACGGTGTG GTCAGCCTTG AGGACGCGGC CAAGATCCGC
GCCGAGGATG GTCGGGCCGT GCACGCCGTG GATCTCTCCC CCTATATCGG CGAGCTCCCC
TTCGGCAAGT CCACGACCGC TTTTTCTACC GATCTGGCCT CCGGCTCCAC CAGCCAGGCG
GCGGCCCTGC AGGAGGCGCT GGAGGCGGGC TCGCGGACCC TGCTGGTCGA CGAGGACACC
TCGGCCACCA ACTTCATGAT CCGCGATCGG CGCATGCAGG CCCTGGTAGC CCGAGACGCC
GAGCCGATCA CCCCGCTGGT GGATCGCATC GCCGAGCTGC GTGACCGGTA CGGGGTCTCG
ACCCTGCTGG TGATGGGGGG CTCCGGCGAC TATCTGGATA GCGCCGATAC GGTGATCCAG
ATGCAGGACT ACCGCCCGGC GGACGTCACC GCCGAGGCGC GCCGCGTGGC GACCGAGTAC
GCCACCGGGC GCCACGCCGA AATCCGTTCG CCGCTGTCCT GGCCGGCCAT GCGGGCCGTG
GACGGGCGGA CGGTGCGCAC CGAAACCAAG CCCGGTAAGC GCAAGGTTCA GGGCCGGGGC
CGGGATACCC TCCTGGTGGG CCGCGAGGCC ATCGACCTGC GCGCCGTGGA GCAGATCGCC
GATGCGGCCC AGGTGCGGGC CATCGGCCTG CTGCTCGAGC GGCTCGCCGA CAGCGGCCGG
GTCGAGGATC CGCCGGCGTG GGTAGCCGAA CTGCTGACGC GCCACTGGGC ACAACTGCTG
CCCCGTCCCG ACGGCGATCT GGCCCGCCCG CGGACGATCG AGGTCATGGC TGCGCTCAAC
CGCCTGCGCG GTGTCCGTTT CAACGCCGAC GGCGTAGGCT AG
 
Protein sequence
MEQLQTLLQR LDGRGYKAYK DIQGRHAFPG FTLCVDKVQA DPFAAPSRMR AVVPWQQVAL 
PEPALAGEPR RRAARDYIAR AFANEADGEQ AVRIDAGAQT VLDRSAVLFT DAGVELRFTV
ALPARGRTIL GRQAAEILCR KLPRIVAGAT EAQRLDAAAL ERHCAAVEDQ HALRGQLAER
GLVAFVADGA VLPRRSGVDD RPLADAIPFD SPDSLRVTLE APNAGSISGM GIPRGITLIV
GGGFHGKSTL LNALELGVYD HRPGDGRDGV VSLEDAAKIR AEDGRAVHAV DLSPYIGELP
FGKSTTAFST DLASGSTSQA AALQEALEAG SRTLLVDEDT SATNFMIRDR RMQALVARDA
EPITPLVDRI AELRDRYGVS TLLVMGGSGD YLDSADTVIQ MQDYRPADVT AEARRVATEY
ATGRHAEIRS PLSWPAMRAV DGRTVRTETK PGKRKVQGRG RDTLLVGREA IDLRAVEQIA
DAAQVRAIGL LLERLADSGR VEDPPAWVAE LLTRHWAQLL PRPDGDLARP RTIEVMAALN
RLRGVRFNAD GVG