Gene Haur_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1413 
Symbol 
ID5733321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1628208 
End bp1630442 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content50% 
IMG OID641278551 
Productband 7 protein 
Protein accessionYP_001544185 
Protein GI159897938 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGA TTGCAGTGTT AGTCGGGATT ACCCTACTGG TCGTGATTCT GTTGGTTGTG 
GTGTTTTTTG CCTTGGTCAA CCATTTTTAT GTGAAAGCAC CAGCCGACCG TACCTATGTG
CGAACTGGTG GCAGCAAACC AAAGGTTGTT TTCAATGGTG GTAGCTGGGT GATTCCAGCT
TTTCATGAAA TTACCTGGGT TGATTTGCGC ACGATGGATA TTGATGTCGA GCGGACGGAA
GCCAATGCGT TGCTGACGAT CGACCCCCAA TATGCCGATA TTCGGGCAAT TTTCTTTATC
AAAGTTAATC CAATTGCTGA AGATATTGAA CGCGCTGCCC GTACCATCGG TGGCAAAGAA
GTTAATACCG ATAGTGTAAA ACGTTTGGTT GAAAGTAAAT TGGAAGGTGC ACTGCGCGAC
GTAGCTGCTA CCTTTACCTT GATGTCATTG CACCAAGAGC GGGAAAAATT TGTTGAGCGA
GTGCAAAATT TGGTGCGTAG CGATTTAGCT GAAAATGGTT TGGTGCTTGA AGCAGTTTCA
ATTACCACAT TAAAGAGTGC TCGCCAAGGG AGTTTTGGCA CTGATGATGT GTTCGGGGCG
CAAGTGGCGC GGGCCAATGC CGAAGTTATT CAGCAAGCGC TCAAACAACG CAACGAAATT
GATCAAATGA CCCAAACTGA AATTGCCAAG CGCAATGCTA CTGCTGAGCA AGAACGCAAT
ACGATCGAGC GCCAAAAGCA ACTTGAAATT GCTCGACGTA ATGCTTCGAC TTCGCAAGAA
CAAAATGATA TTGAACGCTC CAGCGAATTG GAAATTACGC GGCGTAATGC TGATGTTGAT
CAAGAAAAAT TGAATTTAGA GCGAAACCTA TCGCAAGCTC GCGCCACCCA ACAACGTGAA
ATTTTGATTC GCGAATCTGA GGAACGCACT GCTGCTGAGC GAGTGGCTTA CGAACAGCAA
CAAGCTGCTG AATTAAGTCG GGTTGAAAAA GAACGCACGA TTGCCGAAGC TGAAAAGCTT
AAAGAACAAG CGGTCATGTT GGCCGAACAA CGCAAGCAAC AAGCGATTCA GTTGGCCGAA
CAAGAACGTC AGCGCGAAGT GCAGCGCAGT CAAGTTCTGC GTGAACAAGC TGTCCAAGTG
GCCGACCGCG AACGCCAAGT GGCCTTGGCT CAAGAGCAAG CCAAGCTCGA ACAAGCAGAA
AAAGAACGTT TGGCAATTGC TGCCGAACGT GAAGTAGCCG AGCAAGGCGT GGTTACGGTG
CAAGAACGTG CTGCCGCTGA ACGTGAAGCG CAAATTCAAA TTATCAATGC TGAACGTGAT
GCCAAGCGCG AAATTATCAA TCGTAAAAAT GAAGTTGAGC TTGAAACCTT CCGCCAAATT
AAGCAGGCCG AAGCCGATGC TGAAGCCTTG AACAAAAAGG CAACCGCCGA AGCAAGTGCC
GCAATCAAGA TGGCCGAAGC TCGCCGTACC GAAGCCCAAG CCATGTCCGA TGCGGAAATT
CTGCGGGCTG AAGCAACTAA AGCAACCGTT GCTTCCCAAG GTTTGGCCGA AGCTGAAGTG
ATCAAGGCCA AAGCTGATGC CGCCCGTGTC GAAGCTGAGG CAATTCGTGA GCGTGGTTTG
GCGGAAGCTG AAGCCGCTCG CGCCAAAGCC CTCGCCGAAG CCGAAGGTCA AAAAGCCTTG
GCCGAAGCTT TGGCTGCTCA CGCTGGGGTA GCCCAAGAGC TAGAACTTGA ACGGATTCGC
ATGCACGCCC AAGTTGAAAT TGGCGTGGCT CAAGCCAAAG CCATGGGCGA AGCGATGGCA
GCGATGGACT TCAAGCTTTA TGGTACGCCC GAAACTGCTC AACAAATTCT GCGCATGATA
GGTTTGGCCG ACGGCGTTGG CAGTTTGATC AACACCGCAC CTGCTCCATT AAAAGAGCTT
GGCAATCGTT TGATCAACCG TGTGTTGCCA GCGAATGGCA ATGGCGATGC TGAAAAATCG
GCTAGCAACG ACAATGGCTT GAATCTGACC GCTGCCCAAC CAGTGTTGCG CGAAGCAGCC
TTGATTGCCA GCCAATATTT GAGTGCCGAT GAGTTGCAAA CTTTGACGGT TGGTGCAGCC
CTCGAACAAG TCTTGGGTGT CGCTAGTGAA GAGCAACAAG CAGTGTTACA TAAAGTCCAG
GGTATGCTGC AATTGATGCC CCAATTGGCT GACCAACCAT TGAGCAGTGT CTTGATGTTG
GTGCAAAATA GCTAA
 
Protein sequence
MEQIAVLVGI TLLVVILLVV VFFALVNHFY VKAPADRTYV RTGGSKPKVV FNGGSWVIPA 
FHEITWVDLR TMDIDVERTE ANALLTIDPQ YADIRAIFFI KVNPIAEDIE RAARTIGGKE
VNTDSVKRLV ESKLEGALRD VAATFTLMSL HQEREKFVER VQNLVRSDLA ENGLVLEAVS
ITTLKSARQG SFGTDDVFGA QVARANAEVI QQALKQRNEI DQMTQTEIAK RNATAEQERN
TIERQKQLEI ARRNASTSQE QNDIERSSEL EITRRNADVD QEKLNLERNL SQARATQQRE
ILIRESEERT AAERVAYEQQ QAAELSRVEK ERTIAEAEKL KEQAVMLAEQ RKQQAIQLAE
QERQREVQRS QVLREQAVQV ADRERQVALA QEQAKLEQAE KERLAIAAER EVAEQGVVTV
QERAAAEREA QIQIINAERD AKREIINRKN EVELETFRQI KQAEADAEAL NKKATAEASA
AIKMAEARRT EAQAMSDAEI LRAEATKATV ASQGLAEAEV IKAKADAARV EAEAIRERGL
AEAEAARAKA LAEAEGQKAL AEALAAHAGV AQELELERIR MHAQVEIGVA QAKAMGEAMA
AMDFKLYGTP ETAQQILRMI GLADGVGSLI NTAPAPLKEL GNRLINRVLP ANGNGDAEKS
ASNDNGLNLT AAQPVLREAA LIASQYLSAD ELQTLTVGAA LEQVLGVASE EQQAVLHKVQ
GMLQLMPQLA DQPLSSVLML VQNS