Gene Hhal_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0389 
Symbol 
ID4709750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp454072 
End bp456075 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content64% 
IMG OID639854852 
ProductTonB-dependent receptor 
Protein accessionYP_001001985 
Protein GI121997198 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCCGA GGCGATCGTG TACAGCCCCC TTTTTCAGGT CCCTTACGCT TTTCAGCGGT 
GGTTTGGTGA CGCTCCTTCC CGTATCGTTG CCCGCCAAAG ATGCGGATGA CTTGCCTGTT
GTGCGTGTCG AGTCCAGCGT CGATTCGCTG GGGCGGGGCT CGAGCCGCGA AGAATTCCAG
AGGCGCCAGT CCTCGCGCAG CGGCGAGCTC TTTCGGGGTG ATGCGTCAGC CACCGTAGGG
GGCGGCAGTC GCAACGCCCA GCGCCTCTAC CTGCGCGGGG TTGAGTCAAA CAACCTCAAC
GTGACCGTCG ATGGTGCTCG GCAAGGTCGC GACCTCCACC AGCACCGTGG TGGCCTAACC
GGTCTCGATC CGCAGCTGCT CGATGACGTG GAAGTCGATA CGCGGCCTGC GGCGGATCAA
GGTCCCGGCG CGCTGGGTGG CAGCGTACGC TTCCGGACGG TCGACGCGCA GCAGCTCCTC
GATCCTGATG AGCAGACGGG TGCACGCTTG AGGGCCGGTT ACGCAACAGC CGATTCTTCC
GAGCAGGGAT CAGCCACTGT CTTTCAGCGG TTGGGTTTCG ATTGGGGCGC CCTCGCCCAT
GTGAGTGGGG CGAACCGGGA CGATTACGAG ACCGGGGGCG GTGACAACAT GCCGTACTCG
GGCGGAGGCG ATCGTAGCTA CCTGCTGCAG GCTAGCCGAA TGCCTGTGCA CGGGCATGAA
CTGCGCCTCG GTGTCCAGCG GCACAGCTTT GAGGGCGATA CGCTCTCCGG TGGGGCGGGC
AGTGATTTTG GCGATCCCCG GGTAGAGCAC CGAGGGGAGC CGGAAAAACA GGAGCTGCGA
CGGGACACGT GGACGGTGGA GCACCGTTAC GACCCCACCG ACCCCAACGT GGACTGGCAG
GCACGGGTTT ATCGCAACGA TAATCGGCTC AAACGTCTGG ATCAGGGCAC CGAGACGCGC
GCACTTGAGC ACGGCGGCGA TCTTCGCAAC ACCTTCTCCC TCAATGCCGG ACCGACGCGT
CACCAGCTCA CCGCGGGTTT CGACTACTAC ACCGAGGATG GTCGTCTCGA GCAGGACGAC
GGCCCGCGGC TGAGCTACAC GGACCGCAAC TTCGGTGCCT TCCTGCAGAA CCGCATGGAG
TGGGAACGAC TCCGCCTTTC GAGTGGCCTG CGTTTCGATG ACTACACCAG TGCCCTGGGA
GAGCGGAACC CCGAGGGTGA CGCTTTTTCC CCCAACCTGA GTGCAGAACT CGATCTTGCG
GCGGGCTGGG CGGTCTTCGG CGGTTACGGG GAGGCAGCCA GAGGTCCTGG CGGCACAATG
CCCATCGGCT GGGTGCAGTA CATCGAAGAG GGCAACGACG ATCAGTCTTT AAAGACCGAG
GAGTCCCGCC GCAGCGAGGG CGGCCTGCGG TATCAGGGCC GGGGTCTGGT CGCCTCCCGT
GATCGCCTGA ACCTCGAGGC GACGGTTTTC GAGACGCGCA TCGACAACAG CCTTGAACGC
GTTGGCGGTG GTCCCCCGCA CCAATCGGGT GTTCGTCTCG GGCAGCGCGA TGTCCGGATC
AGCGGCTATG AGCTGCGGGC CGCATGGGGG GTGAATGCCT ATGACACGCG GCTGTCGTTC
CTGAGCGCCG AGACGGAGGA CGACGATGGC GACCCGGCTG GGGTTAGCCG TCGTCTGGCC
GGAAGTGGTG GTGATCGTCT GGTCTGGGAT CACCGTTGGG CGGCTCATGA AACCCTGACC
CTGGGGTATA CGCTCACCTG GGTGGGGGAT GACACCGATG TACCTGACGA TGAGCCGGAG
CGCGACGGTT ATCACCTTCA CGACATCCAG GTCCAGTGGC AGCCGTGGGC GGATGAGCAG
GTCACGCTGG GGGTGGTCGT GAACAACCTC TTTGACGAAC AGTATGCCGA GCACACATCG
CTGGTGTCGG AGCAGGATGG TGAGCTGATT GTTCGCGATG AGCCGGGGCG GGATATCCGC
CTGGAAGCGG CGCTGCGTTT TTGA
 
Protein sequence
MHPRRSCTAP FFRSLTLFSG GLVTLLPVSL PAKDADDLPV VRVESSVDSL GRGSSREEFQ 
RRQSSRSGEL FRGDASATVG GGSRNAQRLY LRGVESNNLN VTVDGARQGR DLHQHRGGLT
GLDPQLLDDV EVDTRPAADQ GPGALGGSVR FRTVDAQQLL DPDEQTGARL RAGYATADSS
EQGSATVFQR LGFDWGALAH VSGANRDDYE TGGGDNMPYS GGGDRSYLLQ ASRMPVHGHE
LRLGVQRHSF EGDTLSGGAG SDFGDPRVEH RGEPEKQELR RDTWTVEHRY DPTDPNVDWQ
ARVYRNDNRL KRLDQGTETR ALEHGGDLRN TFSLNAGPTR HQLTAGFDYY TEDGRLEQDD
GPRLSYTDRN FGAFLQNRME WERLRLSSGL RFDDYTSALG ERNPEGDAFS PNLSAELDLA
AGWAVFGGYG EAARGPGGTM PIGWVQYIEE GNDDQSLKTE ESRRSEGGLR YQGRGLVASR
DRLNLEATVF ETRIDNSLER VGGGPPHQSG VRLGQRDVRI SGYELRAAWG VNAYDTRLSF
LSAETEDDDG DPAGVSRRLA GSGGDRLVWD HRWAAHETLT LGYTLTWVGD DTDVPDDEPE
RDGYHLHDIQ VQWQPWADEQ VTLGVVVNNL FDEQYAEHTS LVSEQDGELI VRDEPGRDIR
LEAALRF