Gene Haur_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1454 
Symbol 
ID5736865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1692575 
End bp1693954 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content51% 
IMG OID641278592 
Producthypothetical protein 
Protein accessionYP_001544226 
Protein GI159897979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000963301 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCCAC GTTCGACCGT GCGGCGCAGC ATGGCATTAA TCATTCTAGT AATGCTTTTA 
ACGCTGGTTG TCGCCCGACC CCAAGCTTCT AATGCTGCCG ACCCACCAAC TTATTATGCT
GAAACCGGTC ACTACCTTGG TGGTGGTTTC CGCGATTACT GGAATGCCAA CGGCGGCTTA
CAAATTTTTG GCTACCCCAT CACTGAAGAA TATCGCAACG CACAGGGCAA AACCATTCAA
TGGTTTGAAC GTGCTCGCTT TGAGTTAGCC AGCAATGGCT CAGTCGAGTT AGGGCTTTTG
GGGCGTGAAG CAACCGTTAA TCGGGTGTTT CCGCAAATCC CGCCCCGCGA AAACGATGCC
AACCACCGCT ACTTCCCCGA AACCAGCCAT ATGATTATGT GGGGGTTCAA AACCATTTGG
GAAACCAAAG GTGGTTTAGG CGTATTTGGC TATCCAATTA GCGAAGAAAT GGATGAAATT
CTCGCTTCGG ATAACAAATG GCATATCGTT CAATATTTCG AGCGTGCTCG CTTTGAATTT
TGGCCCGATT ACCCTGCGGG GCAACGGGTT GTCGTCAGCG ATCTAGGTCG GCGGTTGGCT
CCCCGCGAGT TAACCACGCC ATTGCCACCA GGCAGCCCTC CAGGCAGTAC CCCTCCTGGC
GCACCTGGGT TGCCACCAAG CAAAGATGCG ATCGTTACGC CAAGCTCTGG GCCTGAAGGG
ACAACCTTTG GCTTCAATGG TTTTGGCTTT GTCGCCGGTG AAGAAGTGGT TTTGTGGTTG
ACCTCACCTG ATGGTACAGT CTACCCGGCC AATTCAACAA CCTATGCTGA TATCGATGGG
TCGTTAACCT CATCAGGTAT TTATGTCACA ATCAGCCAAG GAGTTGGGGT TTGGGCGATT
ACCGCCCAAG GTCGGCTCAG CGGCCATGCC AGCATTGGCT ATTTTGAAGT TACCCGAGCA
CCTGAGCAGC CACTGCCCGC CGACTATAAT GCACGGGTTG ACCCTCGTGA AGGTCGGCAG
GATACGATTT ACAACTTCTA TGCAGGCGGG TTTGTTCCAG GCGAAGTAGT TGCGGTTGGT
GTACTCAACG AATATGACGA ACTCGTAACC GAAGTTATTG GCGTTTATGC TGATGGTAAT
GGCTCAATCG ATTATGCCAA TATTCGCTTT GTGCCAAACA ATTCCTTCGA TCCAGGTATT
TATGAAATCT ATTCCACCAG TGAAAGTGGC CGTGAAGCCT ATGCCTTCTT GCGTATGCGC
AGCAATAGTG TGACCAGTGT CTCGACCTTA AGCATGCGTC AAGCGCGAAC CACCAGCGGT
TCATTAGGCC GTGGCGATGG GCTAGCCAGC GAAGGCAATA TCGATTTCTT CCAGAAATAG
 
Protein sequence
MLPRSTVRRS MALIILVMLL TLVVARPQAS NAADPPTYYA ETGHYLGGGF RDYWNANGGL 
QIFGYPITEE YRNAQGKTIQ WFERARFELA SNGSVELGLL GREATVNRVF PQIPPRENDA
NHRYFPETSH MIMWGFKTIW ETKGGLGVFG YPISEEMDEI LASDNKWHIV QYFERARFEF
WPDYPAGQRV VVSDLGRRLA PRELTTPLPP GSPPGSTPPG APGLPPSKDA IVTPSSGPEG
TTFGFNGFGF VAGEEVVLWL TSPDGTVYPA NSTTYADIDG SLTSSGIYVT ISQGVGVWAI
TAQGRLSGHA SIGYFEVTRA PEQPLPADYN ARVDPREGRQ DTIYNFYAGG FVPGEVVAVG
VLNEYDELVT EVIGVYADGN GSIDYANIRF VPNNSFDPGI YEIYSTSESG REAYAFLRMR
SNSVTSVSTL SMRQARTTSG SLGRGDGLAS EGNIDFFQK