Gene Haur_3634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3634 
Symbol 
ID5735495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4568530 
End bp4570497 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content53% 
IMG OID641280783 
Producthypothetical protein 
Protein accessionYP_001546398 
Protein GI159900151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTG GCAAAACTGA ACTGATCACG ATTGCCCTTG CTGATCCAAT TGATAGCATT 
ATTCAGCAAG TGCGCAATGC CAAAGCAGAG CATGTTGATA TTTTCATGCC CGAAGGCTCG
ATTTCGTTAC AAAGCCGTAA AACCTGTGAT CGCTTGCGCG AAACTGCCAA TCGCGAGGGC
ATTGAGTTAA CGCTCTATAC CAGCGATCCC AAGGTGGTTA AGGCGGCTTT GACTTGCCAG
ATGATGGTGG TCGAAATTGA GCCAGCCGTT GTTTCAACCA AAGCGCCTGC TGCTCCGCCA
GCTCCAGCTA AACCAGTTGC ACCAGCTAGC CCTGAAGAAG ATTTTTTGGC TTCGCTGGAA
GGGTTGCCCT CTGCTTCCAC CACCGATCGG TTGCAAACCA GCCCCAGCAT TAGCAAGCCG
ATTCCAACGG CAGCCCCCAG TCAACCAGCC CGCAGCGAAA TCGACGATTG GGCTGATGCC
CTTGATAGCC TTTCAGTGGC TTCGACTGCT GGCGAAACTG GCTTGCGCCA ACATGAACAA
GCGCGTGGCA ACGATAGCTG GGATTTTGAT TCATTCAACG ATCTTTCTGA TGCGCTGAGT
GGCGATACGG CCAGCGCTGC GCCAGCTCGC CCACGGATTC GCCCCGAGGA TATTGAATTA
ACTAGCGATG ATATGCATCG CCAAGATGCC AAAGCTCGCC GTGCTCGGGC CGAGCAAAAA
CGCGCTGACG AAAAGGCTGA ACTCAAACAA CAAACCGCGC CCAAACGCAA TTATCCCTTG
TGGGGTATGG TTATTCTAGG GGCGATTGCA ATTATCGCGT TGTTATGGCT GCTGTTTGGC
AATAAATTGG CCAGCGGCGT AACCGTGGTG GTGCGACCAG CTCCAACCTC TGGCGGCCAA
ACCTACGAAG ATGTGCGGCT CAACTATCAA GCCGACCCAA TTAGCGAACC AAGCTCGGCG
GCAATTCAAG GGCGCTTGAT CAACGTGCCA ATTTCGGTGA GCGTGCGTGG TACGGTCATT
ACGGCAACCG AACAACCTGA TCAAGCGGCG ACTGGCATCC TCGAAGTTTA TAACCGCAAC
ACCCAGGCCT ACCCAATTCC TGCTAATACG CGGGTTCGAG TGCTCAACTC GGCTGGCGAA
GAAATTATCT TTTTGACCCA AGCCGAAACG ACGATTCCTG CCGCAACAGG CAACTTTGCC
GGAATTGTCA GCGGGGTTGG TTCATTGCAA ATCGTGGCCA GTGCTGGTGG TACGCGCTAC
AACATGCCTG CTAGCCAAGA TCCGGTTTGG ACGATTGAAG GCTATGAAGG TGCACTCTTT
TCGATCAATC CTGAGCCAAT TCAAGGCGGC ACAACTGCGC TGCTAAAATT GCCAGTCGAA
AGCGATTGGA TGCCGTTACT GCCCCAAGCA GTAGCCCAAT TCCGCAGTGC GATTCCGGCT
CAAATGCAAA CCGTGCTACA AGAAGGCGAA GTCTTGGCTG ATGTTGAATT TTTGCCGAAT
GTCGATGCCT TGACTCAAGA CCCTTCGCTT TACGATATTC AAACCCGACC AGTGCCCGAA
ACCGATGGCG GTTTTGAGTT GGTCGTAACA GCGAATTTCC AAGGCTTGGC GGTGCGTGGC
AGTTTTGCCG AGCAGTTGAA TCGAGCTTTG CCGAATGCCT TGCGGATCAA AGACCCATCA
TTCAGTACCG ATACCACCAG CATTGTGCGT AGCCAAGTGC GCTTGGATGA TTCAGGTGCT
AGTTTGTTGC TGGCAACGGT CGAAGTTGCT CCCAAAACCG CAGTTGGTTT ACCCGATAGC
ACTAAACTGA AAATCGCTGA TAGCATCAAA GGCCTAACCC CAGCCGAAGC CTTGGTGCAG
TTGGAAGCTT TGCGCGAAAG TGGCTTGATT GGCGATATTG TGTCAGTGCC CGAAGTTGAG
CGCTTGCCCG TTGATCCAGC CGAAATTAAT ATTCAGGTGC AGGAATGA
 
Protein sequence
MTTGKTELIT IALADPIDSI IQQVRNAKAE HVDIFMPEGS ISLQSRKTCD RLRETANREG 
IELTLYTSDP KVVKAALTCQ MMVVEIEPAV VSTKAPAAPP APAKPVAPAS PEEDFLASLE
GLPSASTTDR LQTSPSISKP IPTAAPSQPA RSEIDDWADA LDSLSVASTA GETGLRQHEQ
ARGNDSWDFD SFNDLSDALS GDTASAAPAR PRIRPEDIEL TSDDMHRQDA KARRARAEQK
RADEKAELKQ QTAPKRNYPL WGMVILGAIA IIALLWLLFG NKLASGVTVV VRPAPTSGGQ
TYEDVRLNYQ ADPISEPSSA AIQGRLINVP ISVSVRGTVI TATEQPDQAA TGILEVYNRN
TQAYPIPANT RVRVLNSAGE EIIFLTQAET TIPAATGNFA GIVSGVGSLQ IVASAGGTRY
NMPASQDPVW TIEGYEGALF SINPEPIQGG TTALLKLPVE SDWMPLLPQA VAQFRSAIPA
QMQTVLQEGE VLADVEFLPN VDALTQDPSL YDIQTRPVPE TDGGFELVVT ANFQGLAVRG
SFAEQLNRAL PNALRIKDPS FSTDTTSIVR SQVRLDDSGA SLLLATVEVA PKTAVGLPDS
TKLKIADSIK GLTPAEALVQ LEALRESGLI GDIVSVPEVE RLPVDPAEIN IQVQE