Gene Hhal_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1906 
Symbol 
ID4710807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2099951 
End bp2102032 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content72% 
IMG OID639856379 
ProductTonB-dependent receptor 
Protein accessionYP_001003472 
Protein GI121998685 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCC ATGAACCAAC GGCTGTGCGC CTGTTAGGCG CCCTGCTGGC TGCGGCGGTC 
TCGGTACCGG CCGGGGCCGC GGAGGAGGGC GATGAGCACG CGTCCTCCAC CAGCCTCTCG
CCGGTGCGCA TCGAGGCCGG CGGCGATCCC CTGGGCCGCG GCGTGGCCAG CGAAGCCCTG
CAGCGCCGCC AGGCCTCGAG CAGCGCCGAG ATCTTCCGCG GCGAGGCCTC GGCCGGGGTC
GGCGGTGGCA GCCGCAACGC GCAGCGGCTC TACCTGCGCG GTGTCGAGTC CAACAACCTC
AACGTCACCG TGGACGGGGC CCGCCAGGGG CGGGACCTCC ACCAGCACCG CGGCGGCCTC
ACCGGTCTGG ACCCGGATCT GCTGCGGGCG GCCGATCTCG ACCCGCGTCC GGCGGCGGAC
CAGGGCCCCG GTGCGCTGGG CGGTTCGGTG CGCTTCGAGA CGGTGGACGC CCAGGATCTG
CTCGACCCCG ACGAAGAGAC TGGGGCCCGC CTGCGCGCCG GCTACGCCAG CGCCGACGAG
GCCGAGCGCG GCTCGGCCAC CGCCTTCGGC CGGCTGGGCG GCGACTGGGG CGCCCTGGCC
CATATCGGTG CGGTCAACCG GGACGACTAC CGGGTCGGCG GCGGCGACAC CATGCCGTAC
TCCGGCGGGC GCGATCGCGA CTACCTGGCG CGCATCAGCC GCGTGCCGGC ATCGGGCCAT
CAGCTGCGTC TCGGCGTGCA GCGCAACACC TTCGAGGGGG ATCACCATTA CGGCTCCTGG
GGCAGTGACT TCGGTGATCC GAGCGAGACG ACACGCCAAG ACCCGGTGGG CCAGGAGCAG
CGCCGCGACA CCTGGACCGC CGAGCACCGC TACCGCCCGG CCGACCCCCA CGTGGACTGG
CAGGCCCGGG TCTACCGCAA CGACAACCGC CTGGAACGCC AGGACGACAA CACCACCACC
CGCGCCGTCG AGCAAGGCGG CGACCTGCGC AACACCTTCA CCCTGGACGC CGGGCCGACC
CGTCACCGGC TCACCGCCGG TTTCGACTAC TACACCGAGG ACGGGCGCAG CGACCCCCAC
GGAGGCGGCT CGAGGCTCAG TCACCAGTCG CGCAACTTCG GCGCCTTCGT GCAGAACCGC
ATGGCCTGGG AGCGGTTGCG CCTCTCCGCC GGGCTGCGTT ACGACGACTA TGTCACCGAC
CTGCAGGAGG AGACCCTCCA AGGCGATGCC GTCTCGCCCA ACTTCAGTGC GGAGTACGAC
CTGACTGCCG GGTGGACCGC CTTCGCCGGC TACGGCGAGG CGGTCAGTGG CGCCGGGATC
CTGCCGATCG GCTGGCTGGC CTACATCGAC GACGAAGAGA CCAATCTGAA TGACGGCGAG
CCGTTCGAGG CCGAGGAGTC CCGCCGGCGC GAGGGCGGCC TGCGCTACCA GGGGCGGGAT
CTGATCACGG CCCGGGACCG CTTCGACTTC GAGGCGACGC TCTTCGAGAC GCGGATCAAA
AACAGTGTTG AGCGGGATGA CCCGTGGGGC ACACCGCACC AGCACAACCT GCCGCCCGAT
AGGCGACATG ACGCGTTCTG GGATGAGGAC GCCCCGCTGG TTGGGGGCGT CCGCAACCGC
CCCGATCCGG TCCGCCTGCG CGGCTACGAA CTGCGCGCCG CCTGGGGCGT GGGCCCCTAC
GACGCCCGGC TTTCGTTCCT CAGCGCCGAG GCCGTGGACG ACGACGGCGA CCCGGTGGGG
GTGATCCGGC GCCTGGGTGG GGGCGGCGGT GACCGTCTGG TCTTCGATCA GCGCTGGGCG
GCCCACGAGA CCCTGACCCT GGGCTACACG CTCACCTGGG TGGGGGATCA CACCGACGTC
CCCGACGACG AGCCGGAGCG CGACGGCTAC CAACTCCACG ACGTGCAGGC CGAGTGGCAG
CCGTGGGCCG ACGACCGCCT GACCCTGGCG CTGGCGGTGA ACAACCTCTT CGACGAGCAG
TACGCCGAGC ACACCTCCCT GGCGGTGGAG GAGAACGACG AGTGGCAGAT TCGCGACGAG
CCCGGCCGGG ACGTTCGGGT GACCGGCACC CTGCGCTTTT GA
 
Protein sequence
MNTHEPTAVR LLGALLAAAV SVPAGAAEEG DEHASSTSLS PVRIEAGGDP LGRGVASEAL 
QRRQASSSAE IFRGEASAGV GGGSRNAQRL YLRGVESNNL NVTVDGARQG RDLHQHRGGL
TGLDPDLLRA ADLDPRPAAD QGPGALGGSV RFETVDAQDL LDPDEETGAR LRAGYASADE
AERGSATAFG RLGGDWGALA HIGAVNRDDY RVGGGDTMPY SGGRDRDYLA RISRVPASGH
QLRLGVQRNT FEGDHHYGSW GSDFGDPSET TRQDPVGQEQ RRDTWTAEHR YRPADPHVDW
QARVYRNDNR LERQDDNTTT RAVEQGGDLR NTFTLDAGPT RHRLTAGFDY YTEDGRSDPH
GGGSRLSHQS RNFGAFVQNR MAWERLRLSA GLRYDDYVTD LQEETLQGDA VSPNFSAEYD
LTAGWTAFAG YGEAVSGAGI LPIGWLAYID DEETNLNDGE PFEAEESRRR EGGLRYQGRD
LITARDRFDF EATLFETRIK NSVERDDPWG TPHQHNLPPD RRHDAFWDED APLVGGVRNR
PDPVRLRGYE LRAAWGVGPY DARLSFLSAE AVDDDGDPVG VIRRLGGGGG DRLVFDQRWA
AHETLTLGYT LTWVGDHTDV PDDEPERDGY QLHDVQAEWQ PWADDRLTLA LAVNNLFDEQ
YAEHTSLAVE ENDEWQIRDE PGRDVRVTGT LRF