Gene Haur_2389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2389 
Symbol 
ID5734270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3043570 
End bp3044904 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content50% 
IMG OID641279530 
Producthypothetical protein 
Protein accessionYP_001545157 
Protein GI159898910 
COG category[S] Function unknown 
COG ID[COG4325] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATAA CTCGCGAACT CGAAAATCAG TATCATAAGC TGGATTCATC GTTATGGTTT 
CGGCCCACGC TCATGGCAAT TGGCTCGGCT ATTTTGGCCT TTTTCACAGT TGAACTTGAT
CGAGTTTTCG ATTTTGATCA TGTGGCTTTT CTACGGGCTG GGATCGACGA TGCGCGGGCG
ATTCTCTCTT CTGTCACCAG TTCGATGCTC ACCGTCACCA CCGTTACCTT TTCGATCATT
ATGGTAGCCT TGGTACTAGC GTCGCAGCAA TTTTCGCCGC GAATTATTCG TAATGTGATG
CGCGATACGC CTTCGCAATA TGTGCTGGGC ACATTTATTG GCACATTTAT TTATAGTTTG
CTGGTGCTGG GCCAGATTAA CGATCAAGCA TCGTTTGTCT TTGTGCCGAT TCTCTCACTT
GCCACTAGCA TTATGTTAAC TCTCTTGAGT ATTGTGGCCT TTATTTATTT TGTACACCAC
ATCGCCGAGA CGATTCAAGC CAGTGTCTTG ATTGCCCGCG CTGCCGAACG CACAATCGAT
GTGTTGGATC GGCGCTTCCC CGAAACACTG GGCCATGCGA TGGAGCAAAT TCCGCCACCG
CCAATCCCCA ATGAAACCCC AACCACGATT TACAATGCCA AGGGTGGCTA TATTCAGGCG
ATCGATCCTG TGCCTTTGTT GGAGCTAGCT CAGCGCTTCG ATGTGGTGAT TTATATGGAT
CGGGCGGTCG GCGATTTTGT GCCAACTGGC AATCCACTCT TACACATGGT TCCGCAACGT
GAGCTTGATC CCGATAGCAT CGCTGAATTT CAAGATGTGT TTGAGATTGG TTTAGAACGA
ACGTTGTTTG ATGATGTGTT GTTTGGCATT CGCCAGCTCG TGGATATTGC GCTCAAAGCG
ATTTCGCCTG CGGTCAATGA CCCCAGCACC GCGATTAATG CGATCGATTT GTTGAGCGAT
GTACTGGCGC AGGCCATTCG TCGCCCTGAG CAATCGCCAT GTCGCTACGA CGAATTTGAT
CAGCTACGGG TGGTTGCGAA TACAATTACA TTTCGCCAGA TGTTAGGCAC TGCGCTCAAC
CAAATTCGCC AATATGCCAA AGGCGAAATC GCCGTAACTG CCCGTTTGTT GGTGTTACTG
AATGAAGTTG CGCTAGCATG TAACGATCAA GAACGTCGGG CCATGCTCTG GGAGCAAGCC
TGCATCATCA CGCGGGGAGC CGATCAAGCC ATCACTGAGC CATTCGATCG CGCCTATATC
AACGAACATT TGCTGACGCT TGCCAATACA CTAGCAATCG CCTCTGAGCA ACGCATCACG
CTGAAAGTTG GCTAA
 
Protein sequence
MRITRELENQ YHKLDSSLWF RPTLMAIGSA ILAFFTVELD RVFDFDHVAF LRAGIDDARA 
ILSSVTSSML TVTTVTFSII MVALVLASQQ FSPRIIRNVM RDTPSQYVLG TFIGTFIYSL
LVLGQINDQA SFVFVPILSL ATSIMLTLLS IVAFIYFVHH IAETIQASVL IARAAERTID
VLDRRFPETL GHAMEQIPPP PIPNETPTTI YNAKGGYIQA IDPVPLLELA QRFDVVIYMD
RAVGDFVPTG NPLLHMVPQR ELDPDSIAEF QDVFEIGLER TLFDDVLFGI RQLVDIALKA
ISPAVNDPST AINAIDLLSD VLAQAIRRPE QSPCRYDEFD QLRVVANTIT FRQMLGTALN
QIRQYAKGEI AVTARLLVLL NEVALACNDQ ERRAMLWEQA CIITRGADQA ITEPFDRAYI
NEHLLTLANT LAIASEQRIT LKVG