Gene Hhal_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0387 
Symbol 
ID4711457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp451339 
End bp453390 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content67% 
IMG OID639854850 
ProductTonB-dependent receptor 
Protein accessionYP_001001983 
Protein GI121997196 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.84983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCCGA GACAAATCCG CACCATCCGG CCGCTGTTCA CGTACTCCAG CCTGGTCGCG 
CTCGTTACAC TCAGCGCGGC GGTCAAAGCC GATGACCCCC CGGCGGAACT GCCGGAGGTC
TTGGTCGAAC CCCCCGAGGA GACGGAAGGC AGCGGCACCG AGCGGAGCGC ACGCCAGCAA
TCCCATCGCC TGCACGACCT CTTCCGTGGG GATTCCGAGA CCCACGTCAG CGGCCCGCGG
CAGGCCCAGC GCCTCTATCT GCGCGGCATC GAGGGCAGTC AAACCCAGAT CACCATCGAC
GGCGCCCGCC AGGGCCGCGA CCTGCACAAC CACCGCGGCG GCCTGAGCGG GATCGACCCC
GCCTTCCTGC GCCGGGCTGA TGCCGAACCC GGTCCGCCGG CGGCCGATGA CGGCCACGGG
GCCACCGGAG GCTCCGTGCG CTTCGAGACC ATCGACGCCG GCGACCTGGT CGATCCCGAG
ACCGGCTACG GGGGCTTCGC CCGCGGCGTT CGGGGCAGCG CCGCCGATTC CCTGACCACC
AGCGCCGCGG GCGCCATCCA ACCCACGGAT CGCGTCGGCC TGCTGGTCGG GGGCAGCTAC
ACTTCATTGG ACGATTACCG CGTCGGCGGC GGGGATATCA AGGAGTACAC CGGCTACGAC
GACCGCAACC TGTTGCTGCG GCTCAACGCC GACGACGGCC ACAACCAACG GGTCCGCCTC
GGCTACGAGG AAAACGAGAA CCGGGGGGAA CTGCCAATGA ATGCCGGCGA CCGGGTCCGG
GGCGCCGATG GGCACATCCG CGAGGACGAC ATCGCCGACC AGCGCATGGT GCGGGAGACC
ACCAGCCTCA ACTACGAGTA CCACCCGGAA ACGCGGTGGG TCGGCCTGGA GTTCGACCTC
TATCGCAACA AGAGCGAATG GGAGAATCGC GACGACGATA CCGGTTTCCT CAGCGACGGC
GTCGGCGGCC GCCTGGCCAA CACCGCCACC CTGGCACGGG GCTCACTCGG TCCCCTGGGC
CACAGTGAGA ACCGCCTGAC GGTCGGCGGT GACCTGTACC AGGACACGGG TGAGGCGGAC
CACGGAGACA TCCTGACCTA CGACGCTCAG GGGCTGTTCG TGCAGAACCG GCTGGAGAGC
GAGCGACTGG ATCTATCCTT CGGGTTGCGC GGCGACTGGA TGGAGACCGA CTACGAGCAA
CCCGGCGAGT CGGTGGATTT CTCGGAACTC TCCACCAACG CCCGCATCGG CTACTGGGCC
ACCCCCTCCA CGGAGATCTT CGCCGGCTAC GGCGAGTCCG CCCAGGGCCG CTCCGAGACG
GTCGCACTGC ATTTGGACCG CAACATCGAC ACCGAAACGC GGATCGACTA CGACGAGCCG
CAGACCAGCA CCACGGCGGA GGCCGGCATC CAATCCGAGC AACCGCTGGC CGGAGGCTAT
CTGGAGCTGT CGGGCACCCT GTTCCGGACC GACATCGACG ATCTGATCCT CTACGAGTAC
GAGCGGCCGA CGAACCTGGG ACGACAAACG CCTCAGAGCG TCTACAACCT CGACGAACAG
ATCACCACGG AGGGCTATAC CCTCAAGGCA GCGTGGCGGG GGGAGGACCT CTACAGCGCA
TTGAGCTTCA CCCACGACAA GGTCCGCGGC CTCGACAGTG GCAATGCGCT AGGTACCAGC
CGCCCCGATG CACGTCAGCA GCTGGTTCGG ACGGTCGGCC CGCAGGGCGA CCGCCTCGTC
TGGGACAACG TCTACCAGCT CCACCCCGCC TTCCAAGTGG GCTACACCCT GAAGATGGTC
GCCGATCTGG AGCGGGTGAT CCCCGGGGAT GGAGAACGTG ACGGCTACAA CATCCACGAC
GTGCAAATGC GGTGGCAGCC CCCGGGCGAG ACGGATGTCA CCGTTTACTT CGTAGTCCAC
AACCTGTTCG ATGAAGAGTA CGCCGGCCAT ACGGCGATAC CGCAGTACGA AGCCGGAGAG
ACCGTGGCGG ACAGTGACTA CCTGCGGGAG CCGGGACGGG ACATGCGCCT CGGCGCCAAG
GTGCAGTTCT AG
 
Protein sequence
MHPRQIRTIR PLFTYSSLVA LVTLSAAVKA DDPPAELPEV LVEPPEETEG SGTERSARQQ 
SHRLHDLFRG DSETHVSGPR QAQRLYLRGI EGSQTQITID GARQGRDLHN HRGGLSGIDP
AFLRRADAEP GPPAADDGHG ATGGSVRFET IDAGDLVDPE TGYGGFARGV RGSAADSLTT
SAAGAIQPTD RVGLLVGGSY TSLDDYRVGG GDIKEYTGYD DRNLLLRLNA DDGHNQRVRL
GYEENENRGE LPMNAGDRVR GADGHIREDD IADQRMVRET TSLNYEYHPE TRWVGLEFDL
YRNKSEWENR DDDTGFLSDG VGGRLANTAT LARGSLGPLG HSENRLTVGG DLYQDTGEAD
HGDILTYDAQ GLFVQNRLES ERLDLSFGLR GDWMETDYEQ PGESVDFSEL STNARIGYWA
TPSTEIFAGY GESAQGRSET VALHLDRNID TETRIDYDEP QTSTTAEAGI QSEQPLAGGY
LELSGTLFRT DIDDLILYEY ERPTNLGRQT PQSVYNLDEQ ITTEGYTLKA AWRGEDLYSA
LSFTHDKVRG LDSGNALGTS RPDARQQLVR TVGPQGDRLV WDNVYQLHPA FQVGYTLKMV
ADLERVIPGD GERDGYNIHD VQMRWQPPGE TDVTVYFVVH NLFDEEYAGH TAIPQYEAGE
TVADSDYLRE PGRDMRLGAK VQF