Gene Haur_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3761 
Symbol 
ID5735625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4730823 
End bp4732277 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content53% 
IMG OID641280913 
Productpeptidase 
Protein accessionYP_001546525 
Protein GI159900278 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTA CCACAACCAC CAACGACGGA GCAATCACCG ACGAACGCCG TTCACAATCT 
GTTTTCTATC GCGCAATCTG GCGTTGGCAT TTCTATGCCG GTTTGTTTGT TGTGCCGCTG
ATGATTGTCC TAGCCGTAAC TGGCAGCATT TATCTATTCA AGCCCCAACT TGACCGTTTG
ATGTACGGCG ATTTGATGCA TGTCCAACAC ACCAGTGGCT CGGCGCAAAG CTACACTAGC
CAATTGGCCG CTGCCCAAGC TGCTTATCCC GCTGCCAGCG TTGGCAAAAT TCGCCCAAGC
GATGCCCACG ATCGTAGCAC TGAAATTAGT ATGAGCACCA GCGATGGCCG TAATCTGACG
GTTTTCGTCA ATCCCTATAC CAATCAAGTG CTCGGCGAGC GCGATGAAGA TTGGAATTTA
CAAACGATTG CCTTGAAATT ACACGGCGAG TTGCTGATTG GCACAACGGG CGATCGGATT
ATTGAATTAG CCGCTTGCTG GGCGATTTTG CTGACTCTTT CGGGGTTGTA TCTCTGGTGG
CCACGCTCGA AAAGTGGCAT CTGGGGCACG TGGCTGCCAC GCTTGCGCAG CAAAAACAAA
CGGATTTTCT GGCGCGATTT GCATGCCGTG CCTGGCATGT ATGCCTCGTT GATTGTGCTG
TTTTTGCTGA TTTCGGGCTT GCCGTGGACT GGCTATTGGG GTGATAAATT TGCTAATGTT
TGGAGCGGTT ACCCCAATCA ACTTTGGAGC AATATTCCTG AATCTACAGT ATTGACTGGC
AGCCTCAACA CCACAACCGA CAAAGTTGTG CCGTGGGCAG TTGAACAAGC GCCATTGCCT
CAATCTGACC CTGATCATGC TGAGCATCGT GGTGATGGAG CGAGTGCGCC GGTGCCAAGC
AGCGCTACCG AAGGCCCACA AGCCGCAACG CCCGTCACGC TTGATTCGGT GATTGAAGTC
GCCAAAGCCC GTGGCGTGAT TGCCAGCTTT ACGGTTACGC CACCTGATGG AGAAAAAGGC
GTGTACACCA TCGCTGCGGT GGCGAATGAT CCTGCTGATG AAGCCACAAT TCACGTTGAT
CAATATAGCG GAGCGATTTT GGCCGACATT CGCTGGCGTG ATTATGCCAT GGTTCCCAAA
GCTGTTAGCA TGGGTATTTC GCTGCACGAA GGTAAATATT TCGGGCTGGC TAACCAACTT
TTAGCCTTGT TTGGGGCCAT GACGGTGCTG TTGCTATCGG TTTCGGGCGT GGTGTTGTGG
TGGAAACGCC GCCCTGAGGG CCGCTTGGGT GCGCCCAATT TGCCAGCCAA CTTCCCCCAC
TGGAAGCCTG TGTTGCTGAT GGTGGTGCTG GCGAGCTTGG CCTTCCCCTT GGTTGGCGCT
TCGTTGTTGT TTATGCTGGT GTTGGACCTG ACGGTGTTTC GGTTTGCGCC AAGCCTCAAG
CAACGCCTTG CTTAG
 
Protein sequence
MTTTTTTNDG AITDERRSQS VFYRAIWRWH FYAGLFVVPL MIVLAVTGSI YLFKPQLDRL 
MYGDLMHVQH TSGSAQSYTS QLAAAQAAYP AASVGKIRPS DAHDRSTEIS MSTSDGRNLT
VFVNPYTNQV LGERDEDWNL QTIALKLHGE LLIGTTGDRI IELAACWAIL LTLSGLYLWW
PRSKSGIWGT WLPRLRSKNK RIFWRDLHAV PGMYASLIVL FLLISGLPWT GYWGDKFANV
WSGYPNQLWS NIPESTVLTG SLNTTTDKVV PWAVEQAPLP QSDPDHAEHR GDGASAPVPS
SATEGPQAAT PVTLDSVIEV AKARGVIASF TVTPPDGEKG VYTIAAVAND PADEATIHVD
QYSGAILADI RWRDYAMVPK AVSMGISLHE GKYFGLANQL LALFGAMTVL LLSVSGVVLW
WKRRPEGRLG APNLPANFPH WKPVLLMVVL ASLAFPLVGA SLLFMLVLDL TVFRFAPSLK
QRLA