Gene Hhal_0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0303 
Symbol 
ID4711213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp340346 
End bp341707 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content70% 
IMG OID639854763 
Producttetratricopeptide TPR_4 
Protein accessionYP_001001899 
Protein GI121997112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGGC TGGGTATGAT CAAACGCACC GGGCTCTGCG CCCTCTACCT TGCGCTCGGC 
CCCGGTCTGG CAACCGCGGA GCCCCCGTCC CAGCCTACAA AGGTTGAGAT CAATCCGGAC
CGGGTCGCCT TGGAGATCCG GCGGGAACTC GATGCCGACC GCCCCGGGCA GGCCCTCACC
CTGGCCCGAA CCCACAAGGA ACTCGCCGGA TACCCCATCT ACGACTTCGA GGCCGGGCGC
GCCTACCTGC GCAGCGGGGA TATCGACGAG GCGGTGGTGC ACTTCGACCG GGCCGTCATG
GTCGCCCCGG ATGTGCCGCG CTACCGTCTG GAGTACGCCC GCGCGCTGTT CGCGGCCGAG
GATCACGAGG CCTCTCAGCG TCAGTTTCAG CGCGTGCTGG ATACCGATGT ACCGGAGCCG
GTGGCCCAGA ACATCCGCCG CTTCCTGGAG GTCATTGACG CCCGGCTCGC CTCTCGCCGC
CCGGAGACCC GACTGGAGGT CGCCGCCGCC GCTGGCTACG ACTCCAACCC GCTCTCGGCG
GCGGACGACG AGTTCCTGCT CTTCGGCGTC TTCCCGCAGA CCTTCGAGCG CGAGTCCGAT
ACCTTCCTCG ACACCCGCGC GCAGCTGGAA CACCGCCGCC CCCGCACCCG GAGCAGCAGC
TACCACTACC GTGGCGAGGT GGAGCACCGC CGCCACAGCG ACGTCAGCGC CGCCGACCAG
ACCCAAGCGC GCCTGCGCGG CGGGTTGTCG TTCGAGGGGG CACAGGGCCG CTCCTACCGC
CTGCCGGTGG AGGTGCAACA CACCCGCCTG GACGGCGAGA CCTTCCGCAC CCGCGTCGCC
TTCCTGCCCC AGGCGGTGCT CCCCGGGGCC CCGGACCGCC AACTGCGCCT GCAGGGCCAG
CTGGCCTACG CCGATTACGA CAACGACGAC CGCGACGCCG TCACCCTCGG CGCCTCCGCC
ACCTCGCTGC ACGTCCTCAA CCCGGATAGC GGGCTCTTGC TCTACACCGG CCTGGCGGCC
TCCTATGAGG ACGCCGACGC CGACGCGTTC ACCACCACCC GCGCCGGCGC ATTCGTAGGC
GCCCAACGGG ACCTGTTCCA GGACGCCACC GCCAGCGTCA CGCTCACCGC CTCCCACGAG
CGGGCCCGCG AGGCGCGCGC GATCCTCGGC CTCTTCCCGG AGGAGGACAA CACCGCCGAG
CGGGCCACGA CGTTCGAGCT CCGCGGGGCC CTGGCCCACC CGCTGTGGGA CTCCGGATTC
ACCGGCTTCG CCGAGGGCGC CCTGCGCGAG AAGCGCTCCA ACATCGACCT GTTCGAGTTC
ACCCAGCGTG AGATCTTCGC CGGAGTGCGC TATGACTACT GA
 
Protein sequence
MGRLGMIKRT GLCALYLALG PGLATAEPPS QPTKVEINPD RVALEIRREL DADRPGQALT 
LARTHKELAG YPIYDFEAGR AYLRSGDIDE AVVHFDRAVM VAPDVPRYRL EYARALFAAE
DHEASQRQFQ RVLDTDVPEP VAQNIRRFLE VIDARLASRR PETRLEVAAA AGYDSNPLSA
ADDEFLLFGV FPQTFERESD TFLDTRAQLE HRRPRTRSSS YHYRGEVEHR RHSDVSAADQ
TQARLRGGLS FEGAQGRSYR LPVEVQHTRL DGETFRTRVA FLPQAVLPGA PDRQLRLQGQ
LAYADYDNDD RDAVTLGASA TSLHVLNPDS GLLLYTGLAA SYEDADADAF TTTRAGAFVG
AQRDLFQDAT ASVTLTASHE RAREARAILG LFPEEDNTAE RATTFELRGA LAHPLWDSGF
TGFAEGALRE KRSNIDLFEF TQREIFAGVR YDY