Gene Haur_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1682 
Symbol 
ID5733566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1956733 
End bp1958310 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content50% 
IMG OID641278821 
Producthypothetical protein 
Protein accessionYP_001544453 
Protein GI159898206 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATC GACGAAATTT TTTGAAAATC AGTGGACTGG CCGCTGCCTA TAGTGTGCTC 
CCCCTCTGGC TAAGCGCCTG TGATCGGGCG GCTCCTAGCG CAATCGCCCA GCCAACCTGG
GAGCTTGAAG CTCCGACCAA CGGCGATCAT CGCAGCCGTG TGGCAATTCG CCATTTGCTC
AATCGGCTCA GCTATGGCCC ATTGCCCGGC CAAATTGAGC AAGTTCAAGC CCTCGGTTGG
GATGCCTACC TTGAACAGCA ATTAAACCCA AGCCAGCTTG ACGACTCAGC ACTTGAGCAA
CAATTGGCTC AATTTACAAC CCTCAAGCTC TCTAGTGCCC ATTTAATCGA GCACTATCCC
AAAGGTGCGA ACGGCCCACG CCTGATTATG CGCGAGCTTC AAGCCGCTAG TTTATTGCGG
GCAGCGAGCA GCCAACGCCA ACTATTCGAG CTAATGGTCG ATTTTTGGAG CAACCATTTC
AATATTTACA TTGGCAAAAA TCAGGTCAAA TGGCTCAAAA CGGCTGATGA TCGTGAAGTA
ATTCGCCAGC ATGCGCTCGG TAAATTCCGT GATTTATTGC TGGCTTCGGC CAAAAGCCCA
GCCATGTTGG TCTATCTCGA TAATGCCGAA AACGTGCGAC CTGGAGTTAA GGTTGGCAAG
AAGATGCTTG GCTTGAATGA AAATTATGCC CGCGAACTGC TCGAACTGCA TACCGTCGGG
GCTGATGCAG GCTATAGCCA AGCCGATGTC CAAGCAGTTG CCCGTGTTTT GACGGGCTGG
ACAATCACCC GAGCCAACAG CGAGCAGCCT GGACTTTTCC AATTTCTGCC CAAATTTCAC
GATGTTATGG CCAAACGAAT CGATTTTCTG CAGCTAGATT TGGCTGCCGA TGGCGAAATC
GAAGAGGGCG AATTATTGTT AAAGCTGTTG GCTGAACACC CTAAAACTGC CCAACGATTG
GCCTATAAGC TCTGCCTACG CTTCGTCAGC GATGATCCAC CAGCTGATTT AGTTGAGCGG
GTTGCTCAAG CGTATCTTCA GCACGATACC GATATTCGCG CCATGCTCAA CATGTTAGTC
AACTCCGCTG AGTTTTTGGC CGCTGCTCAG CAAAAAATCA AACAGCCCAT GCATCTGTTA
ATTTCAGCCA TTCGCGCTAC CAATAGCAGC ATCACCAAAC AAGCCTTCAA GGGTAAAAAC
AACCTCCTTG ATCAATTGGA AACCTTAGGC CAAATGTTTT TCAACTGGCC TCCACCCGAT
GGCTATCCCC AAATCAGCAG CGCTTGGATC AACACTGGCG CGATGCTCAG TCGTTGGAAT
CTGGCCTTTG CACTCGCTGA AGGTCGAATC GACGGCCTAA AAACCGATGT CCCTAAATTC
GCCAAACAGC CAAGCCAAGC CAGCGAATTG GTCGATACGC TAGCCGATTA TCTCAATTTA
AGCCTTGCCG CCGAGTCGCG AGCCAGTTTA ATCGATTATT TAAATGATAG CCAATCACCA
AACCCAACCG TTGATCAAAC TAAAATCGCT GGCCTACTTG GCCTATTGCT GACCAGCCCT
GAATTTCAAT TGTGCTGA
 
Protein sequence
MLNRRNFLKI SGLAAAYSVL PLWLSACDRA APSAIAQPTW ELEAPTNGDH RSRVAIRHLL 
NRLSYGPLPG QIEQVQALGW DAYLEQQLNP SQLDDSALEQ QLAQFTTLKL SSAHLIEHYP
KGANGPRLIM RELQAASLLR AASSQRQLFE LMVDFWSNHF NIYIGKNQVK WLKTADDREV
IRQHALGKFR DLLLASAKSP AMLVYLDNAE NVRPGVKVGK KMLGLNENYA RELLELHTVG
ADAGYSQADV QAVARVLTGW TITRANSEQP GLFQFLPKFH DVMAKRIDFL QLDLAADGEI
EEGELLLKLL AEHPKTAQRL AYKLCLRFVS DDPPADLVER VAQAYLQHDT DIRAMLNMLV
NSAEFLAAAQ QKIKQPMHLL ISAIRATNSS ITKQAFKGKN NLLDQLETLG QMFFNWPPPD
GYPQISSAWI NTGAMLSRWN LAFALAEGRI DGLKTDVPKF AKQPSQASEL VDTLADYLNL
SLAAESRASL IDYLNDSQSP NPTVDQTKIA GLLGLLLTSP EFQLC