Gene Haur_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3849 
Symbol 
ID5735714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4832750 
End bp4835197 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content53% 
IMG OID641281002 
Producthypothetical protein 
Protein accessionYP_001546613 
Protein GI159900366 
COG category[S] Function unknown 
COG ID[COG4485] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCA TGCAACCAAG CCTATTTTTG CGCCAGTGGC AACGCTGGTG GCCGTATCTC 
AGCATTACCT TTGTCGCCCT GCTTTTGCTG TGGCGGGTGG TGCTGGGCAA CATTTTCTTG
CCGCTGGATA TCGTGGCCCA CCTGCATCCT TGGCGCTTTT CCTACGAACG GGTGGCGGTC
AATAATCCAA TCAATAGCGA TCTGGTTACC CAAATTTACC CGCGCCGCTT GGTAACCAAC
CAGATTCTTG AGCAAGGCGC GTTGCCCCTA TGGAACCCGA CGATTTTAAC TGGCACGCCG
TTGTTGGCCG ATGGTCAGTT GGCCTTTTTC TACCCTTTGA GTTGGCTGTT TGTGCTGCTG
CCAGTTGGCT ATGCCTTTGG GATTTATACG CTGTTGAATG TGTGGTTGGC GGGGATTGGC
ACGTTTAAAT TTGCTCAACG CATGCAACTT GAGCCAATGC CAGCAACGCT CGCTGCCGTG
GGCTATATGC TCAGTGGCTT TTTACTCAAT TGGCTGCATT TTCCCGAGTT TAGTGCGGCT
TGTGCCATGC TGCCGTGGTG TTTTTGGGCG GTGCTGCGGG CCTGCCAAAG CCAACGTTGG
CACGATTGGC TGCTCGCCAG TTTGGTGCTG GCCTTGCCGT TGGTCAGCCA AATTCAACTA
GCCTTCTATG TGTATGTCGG GGTTGGCTGT TTGCTGCTGG CTCAACTGTT GGCGTTGCCA
ACTTGGCGCT TACGCTTCCA ACAAATCGGC CAATTTAGCA GTGCAATTGG CTTGGCGCTC
GGATTGAGTG CGGTGCAGTT GTTGCCACAA ATTGCCCTTT CGGCTCAGGG CCAACGCCTT
GATATTGGCT CAGGGCTTGG CTCGGCCAGT TCAATCATGG TGTGGCTGCT GCGCTTGGCG
TTGCCGATTG TTGATGGAGC CGCCCGCGAA ACTGCTAGCG CATGGCAACC ACATTTGTTG
CAGGGCATTC AACCCTATGC AGGCATCGTA AGTTTGGCCT TGGCAGGCTT GGCGATTTGG
CGCAGCAAAC AGCCTGGCGT GAGGCTGTTT GCCTGCTTGG CGCTTGGCTC GTTTGCGGTG
GCGATTGGCA CACCGTTGCT CCAATTGCTG CTTTGGTTGG TTCCGCCCTA TCGCCAATTT
GCTGATCATC AGCGGTGGTT TAGCCTGTGG GGTTTTGCGA TAGCCCTGTT GGCAGGCTTT
GGCTTACAAC GTTTGCAGCA ACCGAGCAAC AAGCCAAATC GGGCGCTCTG GGTGCAACGC
GGCCTCTTGC TGCTTGGATT AATTGGTGTG GCTGGCTGGG CTTTGCAACA TATCGCCTTA
TTCACTGTCG ATTCGCGTTA TGCCCAATAT AGCACCATGC TGCGTTTGGC ACTCAACCCA
ACCAGCCTTG CTATTCTGGG CTTGAGTGGT TTGGCATTGG TAGGGTTGTT GATCAAACGC
ATTCCACGGC GTTGGAGCAA TCTTGCTGTT TTGTTGATTC TGCTGGGCGA TTTGCTTTGG
TATGGTGGCA GCTACAACAC CAGCATTGAT CCTGCAATCT TCCAGCCAAC CGCTGATCAG
CAAGCCAGTT TGGCTGCCGA GCCAGCCTTG CAAGATCCGG CGATTCTGTA TCCGCCAACT
CGCCAGATCA ATTTTCTGCT GAGCCAACCC GGGGTGTTTC GCGTGTTTGG AGCCGATTAT
CAAGCCATGC CGACGAATGT GTTTAGTGCT TTTGGACTTG AGGATATTCG CGGCTATCAA
TCGCTCTATT TAGCCCAATA CAATCGGCTA ACGCGCCTAA TGGATGGCAA GGATTATCAT
AAACTTGGCG AGGGTGGCAA CAGCCTACAC GCGTATTTCA CCATGGCCTA CAACCAGCGG
CGTTTGCTGA ATATGCTGAA TGTGGAATAT CTGATTTTTA CGCCTAATAG CCCCAACCCT
GAGTTGTATC AACCCTTAGA ATTGGTGCAA CGTAACGATG AAGGCACGAT TTATCGCAAT
CCTGAGGTGT TGCCACGCGC CTGGATGGTC TATCAAACCG AGGTAATTAG CGACGAATTA
GCCCAACTTG ATCGGCTGGC AGCTAACGAT TTTGACCCAG CCAAGCAAGC AATTGTGGCC
GAGCCAATTC CGGCGCTTGG CCAAGCACCG AGCCAAACGC TGACTCCTAC GGTGAGCTAT
GAGCCAAATC GGGCGCTGGT GCAGGTCGAA ACTTCAGCGG CAGGTTTGTT GGTTTTGGCT
GATGCCTACA CCAACGATTG GCAAGTAAGC GTTGACGGCC AAACAGCTCA ACTCTATCGC
ACCAATTATG CTTTGCGCGG CGTATGGGTT GATGCAGGCC AACACACAGT CGAATTCAGC
TATCGACCCA AGAGCTTAAT CGTTGGTGGT TGGGTTAGTG GCCTAAGTTT AGCCCTGATT
TTGCTTGGCC TAGCTTTGAG CTGGTATAAA ACGAGAAAGG CTGCATAA
 
Protein sequence
MSAMQPSLFL RQWQRWWPYL SITFVALLLL WRVVLGNIFL PLDIVAHLHP WRFSYERVAV 
NNPINSDLVT QIYPRRLVTN QILEQGALPL WNPTILTGTP LLADGQLAFF YPLSWLFVLL
PVGYAFGIYT LLNVWLAGIG TFKFAQRMQL EPMPATLAAV GYMLSGFLLN WLHFPEFSAA
CAMLPWCFWA VLRACQSQRW HDWLLASLVL ALPLVSQIQL AFYVYVGVGC LLLAQLLALP
TWRLRFQQIG QFSSAIGLAL GLSAVQLLPQ IALSAQGQRL DIGSGLGSAS SIMVWLLRLA
LPIVDGAARE TASAWQPHLL QGIQPYAGIV SLALAGLAIW RSKQPGVRLF ACLALGSFAV
AIGTPLLQLL LWLVPPYRQF ADHQRWFSLW GFAIALLAGF GLQRLQQPSN KPNRALWVQR
GLLLLGLIGV AGWALQHIAL FTVDSRYAQY STMLRLALNP TSLAILGLSG LALVGLLIKR
IPRRWSNLAV LLILLGDLLW YGGSYNTSID PAIFQPTADQ QASLAAEPAL QDPAILYPPT
RQINFLLSQP GVFRVFGADY QAMPTNVFSA FGLEDIRGYQ SLYLAQYNRL TRLMDGKDYH
KLGEGGNSLH AYFTMAYNQR RLLNMLNVEY LIFTPNSPNP ELYQPLELVQ RNDEGTIYRN
PEVLPRAWMV YQTEVISDEL AQLDRLAAND FDPAKQAIVA EPIPALGQAP SQTLTPTVSY
EPNRALVQVE TSAAGLLVLA DAYTNDWQVS VDGQTAQLYR TNYALRGVWV DAGQHTVEFS
YRPKSLIVGG WVSGLSLALI LLGLALSWYK TRKAA