Gene Hhal_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1061 
Symbol 
ID4709827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1149272 
End bp1151347 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content71% 
IMG OID639855532 
ProductTonB-dependent receptor, plug 
Protein accessionYP_001002639 
Protein GI121997852 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGTTAT CGCTGGCCCT CGGTCCCACC GGTCCGGCGC TCGGCGCGGA GGCGGCGCTG 
CCGCCGGTGG TCGCCACCGT GCCGCGGCTC GACGTGGACC CCGACGACTA CCCGGCGGCC
GTCTCCGTCG TCGGCCGCGA GGCCTACGCC AGCGGCCGGC GCCTCTCCCT GGATCAGGGC
CTCGATCGGG TCCCCGGCGT GCATACGCAG AACCGCTACA ACGCCGCCCA GGACCTGCGC
CTGTCGGTGC GCGGGTTCGG GGCCCGCTCA CCCTTTGGGG TGCGCGGCAT ACGGGCCTTG
GTCGACGACG TGCCGTACAC CGTGGCCGAC GGCCAGAGCC AGCTCGACGC CCTGGACCCG
GCCTTTGTGG AGCAGCTGGA GGTCATCCGC GGGCCGGCCT CCGCCCTCTA CGGCAACGCC
GCCGGCGGTG TCTTCCGCTT CACCACCCTG GAACCCCCGG AGAGCGGCGA GTTCCTCGGC
GGCGAGGTGC TCTTCGGCAG CCACGGCGAG CGGGCCCAGC GGGTCTGGGG CGGTCGTGCC
GAGGAGGCCT GGCGCACGGC GGTGACCGCC TCCAACCAGG AGATGGACGG CTACCGCGAC
CATTCGGGCG CGGCGCAGCG CCGACTGAAT ATCAAGCTCG ACCGCGATCT GGCCGGCGGC
GAGCGGCTGC GCTTTATCGC CAACCTCCTC GATGCGCCGG AGGCTCAAGA CCCCGGCGGG
CTGAGCCGCG AACAGGTGCG GGACGATCGT CGGCAAGCGA CGGACAACGC CGTGGACCGT
GACGCCGGCG AAGAGGTGGA GCAGCAGACC GGTGCCGTGA TCCTCACCGC CGAGCCGTCG
GCCACCGAGT TTTGGGAGGG GCGGGCCTTC CTGCAGCGCC GGGACTTCTA CCGGCGGAAC
CCGTTCCCGG GTGGTAATGG TGACCCAGGC GGCATTGTCA CCTTTGAGCG CTACTACGGC
GGCGTCGGCG GCCGTCACGT GCGGCGTACC GAGGCCGCCG GGCGCCCGGT GCAGGTGTCG
GTGGGCACGG ACGCCGAGTG GCAGTACGAC GACCGGCAAC GGTACGTGAA CAACAACGGT
CGTGAGGGTG AGCGGACCAA CGATCAGCGG GAGCGGGCCC AGGTGGTGGC CGGCTACCTG
CAGGCGGACT GGGAGGTGGC GCCGCGCTGG CGGCTGGTCG GTGGCGGGCG TCTGGACTGG
ACTCGGCTGA GCATCGATGA TCGCGAGGCG GGGTTGGATA CCGACAGTCA GACCTACACC
GAGCCCAGCT ACCTCGCCGG GGTGCGCTAC GCCGTGGCCG AGGACCACAG CCTCTACGCC
AAGGCCAGCT CCGCCTTCGA GACGCCGACG CTCTGGGAGC TGTGGGACCG GGACGCCGGG
GGCATCGACG ACACCATCGA GCCGCAGCAG GCGCGCTCCG TGGAGGTCGG CGTCAAGGGG
CGGGCCCTGG ATCGGCGGCT GCGCTACGAG CTGACCCTCT TCCAGGTGCG GACGGAGGAT
GAGCTGGTTC CGCAGGAGGA TGCAGACGGC CCGACCCGGT ATGCGAACGC CGGCGAGACT
CGCCGGCGTG GCGTGGAGTT GGGGGTGGAG GCCTTCCCCA CGGAGCGCCT GGAGGTGACG
GCCGCCCTGG CCTACGGGCA ATTTACCTTC CGGGATTTCG AGACGTCCGA GTTGGAGGGC
GTATCCGGCG CCGAGGACCC CGATCAGGTT CGTGGCAACC GCATCCCCGG TGTGCCGCGG
GCCCACGGCT ACCTGGAGAC GGCCTGGCAG GCCCCCAGCG GCTGGCGCTG GGCCGCCGAT
GTGCGCGCCA GCGAGGGCAT CTGGGCCGAC GACCGCAACA CCCAGCGCAG CTCCGGCTAT
ACGGAGGTCG GCCTCCACGC CAGCCGGACC TTCGTCACCG ACGCCTACGA GGCCGAGCCC
TTCTTCGGGG TGAACAACCT CTTCGACGCC ACCTACGACG CCAACGTGCG CATCAACGCG
GCCAACGAAT CGGATAACAG TCTGGAAGAC GGTGGCTACT TCGAGCCGGC CCCGGAGCGG
ACCATCTACG CCGGTATCCG CTTCGCGACC TACTGA
 
Protein sequence
MGLSLALGPT GPALGAEAAL PPVVATVPRL DVDPDDYPAA VSVVGREAYA SGRRLSLDQG 
LDRVPGVHTQ NRYNAAQDLR LSVRGFGARS PFGVRGIRAL VDDVPYTVAD GQSQLDALDP
AFVEQLEVIR GPASALYGNA AGGVFRFTTL EPPESGEFLG GEVLFGSHGE RAQRVWGGRA
EEAWRTAVTA SNQEMDGYRD HSGAAQRRLN IKLDRDLAGG ERLRFIANLL DAPEAQDPGG
LSREQVRDDR RQATDNAVDR DAGEEVEQQT GAVILTAEPS ATEFWEGRAF LQRRDFYRRN
PFPGGNGDPG GIVTFERYYG GVGGRHVRRT EAAGRPVQVS VGTDAEWQYD DRQRYVNNNG
REGERTNDQR ERAQVVAGYL QADWEVAPRW RLVGGGRLDW TRLSIDDREA GLDTDSQTYT
EPSYLAGVRY AVAEDHSLYA KASSAFETPT LWELWDRDAG GIDDTIEPQQ ARSVEVGVKG
RALDRRLRYE LTLFQVRTED ELVPQEDADG PTRYANAGET RRRGVELGVE AFPTERLEVT
AALAYGQFTF RDFETSELEG VSGAEDPDQV RGNRIPGVPR AHGYLETAWQ APSGWRWAAD
VRASEGIWAD DRNTQRSSGY TEVGLHASRT FVTDAYEAEP FFGVNNLFDA TYDANVRINA
ANESDNSLED GGYFEPAPER TIYAGIRFAT Y