Gene Hhal_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1043 
Symbol 
ID4709797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1127722 
End bp1130067 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content68% 
IMG OID639855514 
Productvon Willebrand factor, type A 
Protein accessionYP_001002621 
Protein GI121997834 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTGC GTCTCGAGGA CTACCGCGAG ACCCTGGTGG CGGCGGATCC GCGGATCGCC 
GAGACCCTGG AGGCGAGCTT CGCCGAGGCG TCGCGGGTGA TGTCGCCGCG GGGGTTGCAC
AACTACCTGG AGGGTGCGCG GGCCCTGGCC GAACTCGGCC GCGGTGCCGA CCTGGTCATC
GGTTATCTGG AGGCGATGCC GGCGGTGGCC AAGGAGGTCG GTGAGGACGT CCTGCCCGAG
GTGGTCACCG CCGCCATGAA GCTCTCCTCC ATGGTCAGCG GCCAGGTGAT CGCCCTGCTG
CTGGCCGGGC TGCCCACGGC CGCGCGGCGG CTCGGCGATC CGGACTTGCT GCGCCAGTTC
CTCAACCTGG TCCATCAGCT CTCGGCCAAG GCCCCGCGCG GCCTGCGGCC GATGCTCGAG
AACCTCGACG AGCTCTTCAC CAAGCTGACC CTCGGCGGAT TCCGACGCTG GGCGCTGTGG
GGGGCCCAGG CCCACGCCCG CGACTTCGAG CAGCAGCGGG GTTACTTTGG CCTGCAGACC
GCCGACAGCC AGGCGATGCT CCAGCAGGAG CGCCGCGGCA CCCTGCTGGT GGACAACCAG
CGCAAACTCA ACTTCTATCT GCGGGCCCTG TGGGCGCGTG ACTTCTTCAT GCGCCCGACC
TCGGGCGACT TCGAGACCCG CGAGGGGTAC AAGCCGTTCA TCGAGCAGCG GGTGATCCAC
CTGCCCGATG CCTACGACGA TTACCACGGA CTGCCCGGCA AGGAGCTCTA CCGAGCCGCG
GCCGCCCACG CCGCCGCGCA CCTGTTCTAC ACCACGGAGC CCTTGTCGCC GGAGATGCTC
AATCCGGCGC AGATGGCCGT GATCGGTCTG ATCGAGGATG CCCGGATCGA GGCGCTGGCC
ATCCACGAGT TTCCGGGGCT GCAGCGACTC TGGCAGCCCT TCCACCAGGC CCGGGCCAGG
GAGCGCGCCG GCGAGGACCC GGACCCGGTG GTCGAGCGCC TGGAACGGGC CGCCCACGCC
CTCATCGACC CGGAGTACGC CGACGAGGAT CCCTGGGTGC AGCAGGTCCG CGAACTCTTC
GCCTACCACT TCGCGGATCG GCCACGGGAT GTGGGGCTCT CCTGGGAGTT GGGCATGGAG
CTGCACAACA GCCTGGCCCA GCGCTATTCC ATGCCTTCGG CCCGGGTCCT GGAGCGGGTC
GCCGTTCCCT ACCGCGATGA CAATCGCTAT CTCTTCGCCA GCGACGAGGA CGAGTGGCTC
GAGGCCGAAT ATGTCCCGGC CAGCCACCGT CAGGTGCGCC GCCAGGTCAA CCTGATGGAG
TTCGTCAACG AGGTCGATTG CGAGCTCGCC GGCGACGACG CCCAGGAGGT CTGGGTCCTC
GGCACGGAGC TCTTCCCCTA CGAGGACTAC GGGGTGAGCT ACAACGAGAT GGAAGGGGTG
GAGCCGGTCA GCCCGCCGTT CCACTACCCG GAGTGGGATT ACCAGGTCCA GCTCAATCGC
CCCGACTGGG TCACGGTGGT CGAGCGCCGT CCGAAACGCG GCGACCCCGA GGTCATGGAT
CAGGTCCTCA AGGACTACCG CCCGGTGGCC AGTCGGCTGC GCTATCTCAT CGACGCCCTG
CAGCCCCAGG GAGTGATCCG CGAGCGGCGG CAGGAGGACG GCGACGAGCT GGACATCGAC
GCGGCGGTGC GCTCCATGGT GGATCTGCGC CTGGGCATGA GCCCGGATCC GCGCATCAAC
ACCCGGTATA TCCGCAAGAC GCGGGATCTC TCGGTGCTGC TGCTGCTCGA CCTCTCCGAG
TCCACCAACG ACCCGATGGG CGGTAGCGAG AAGACCGTCA TCGAGTTGAC CCGGGAGGCC
ACGTCGTTGC TCGGCTGGGC CATCAACGGC ATCGGTGACC CCTTCGCCGT GCACGGCTTC
TCTTCGGACG GTCGGCACGA TGTGCAGTAT TACCGGTTCA AGGACTTCCA TCAGCCGTGG
GGCGAAGAGG CGAAGTCCCG CCTGGCCGGC ATGCGCGGGC AGTACTCGAC TCGCATGGGC
GCGGCAATGC GCCACGCCGG GGCCCACTTG GTCCGCCAAC CGCAGCGACG CAAGCTGTTG
CTGATTGTCA CCGACGGAGA GCCCCACGAC GTCGATGTGC GTGATCCGCA GTACCTGCGC
CACGATGCCC GCAAGGCCGC CGAGGAGCTG TCCGCCCGGG GAGTCACCAG CTACTGCCTG
ACCCTGGACC GGGATGCGGA CGCCTACGTG TCACGGATCT TCGGGGCCAA CGGGTACTCG
GTGGTCGAGC AGGTCGAGCG CCTGCCGGAG CGTCTGCCGG CGGTCTTTGC CCAGCTGACC
CGGTAG
 
Protein sequence
MTVRLEDYRE TLVAADPRIA ETLEASFAEA SRVMSPRGLH NYLEGARALA ELGRGADLVI 
GYLEAMPAVA KEVGEDVLPE VVTAAMKLSS MVSGQVIALL LAGLPTAARR LGDPDLLRQF
LNLVHQLSAK APRGLRPMLE NLDELFTKLT LGGFRRWALW GAQAHARDFE QQRGYFGLQT
ADSQAMLQQE RRGTLLVDNQ RKLNFYLRAL WARDFFMRPT SGDFETREGY KPFIEQRVIH
LPDAYDDYHG LPGKELYRAA AAHAAAHLFY TTEPLSPEML NPAQMAVIGL IEDARIEALA
IHEFPGLQRL WQPFHQARAR ERAGEDPDPV VERLERAAHA LIDPEYADED PWVQQVRELF
AYHFADRPRD VGLSWELGME LHNSLAQRYS MPSARVLERV AVPYRDDNRY LFASDEDEWL
EAEYVPASHR QVRRQVNLME FVNEVDCELA GDDAQEVWVL GTELFPYEDY GVSYNEMEGV
EPVSPPFHYP EWDYQVQLNR PDWVTVVERR PKRGDPEVMD QVLKDYRPVA SRLRYLIDAL
QPQGVIRERR QEDGDELDID AAVRSMVDLR LGMSPDPRIN TRYIRKTRDL SVLLLLDLSE
STNDPMGGSE KTVIELTREA TSLLGWAING IGDPFAVHGF SSDGRHDVQY YRFKDFHQPW
GEEAKSRLAG MRGQYSTRMG AAMRHAGAHL VRQPQRRKLL LIVTDGEPHD VDVRDPQYLR
HDARKAAEEL SARGVTSYCL TLDRDADAYV SRIFGANGYS VVEQVERLPE RLPAVFAQLT
R