Gene Haur_3457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3457 
Symbol 
ID5735318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4346603 
End bp4348459 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content49% 
IMG OID641280604 
Producthypothetical protein 
Protein accessionYP_001546221 
Protein GI159899974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACAAT TATTTGGATG TCTTTGCATC CTGATCGTGG TGCTGAGTTT CAGTTTTTCC 
CCGCGCCAGC CAGCCCAAGC GGCTCCGATG ATCGAGTATC GTGATCCTGC TCCCTTTTCT
AAAGCGGTAA AACCCACCAA CACGATCGCG ATTCGCTTAG GGCCAAGCCT GACCAAAGCC
GCCGTTGCAA CCGTAGAATT TCAGGTTGTT GGCTCACGCA GTGGTTTGCA TGCAGGCGAC
GTTGTGCTAG CCGAGGATCA ACGCACGGTT ATTTTTAAGC CAGCCAGCCC ATTTGTGTTT
GGCGAAACCG TGCGCGTATC GATTAAAAGC AGCGATATTG CCGAGCTTGA TCAGCAAACA
TGGTCGTTTG AAGTAGTTGA GCGCTTGGTT AAACAAACTG ATCAGGTCAA GCAACTGCAA
ACTGAGCTTG CCGCCGAATT GGCTACTCAA GCTAAAGCAG CACAACCAAG TGGTAGCAGC
CCGGTCTTGC GCACTGTGCC CTTTAATCTG CCGCCGTTGA CGGTTACGCT AGCGATTAGC
AATACCCCTG GCTATATTTT TGTTAGTCCA TTTAGCTGGA TCAGCAATGT CACGCCCAAT
CGCTATTTAA TGATGGTTGA TAACACTGGT GCGCCAATCT ACTACAAGGG GCTAGGTAGT
GGCCGCTTTT CACTCGATTT TCGCAAAATT GCCGAGGATA AACTGGTCTA TTTCGATACC
AGTACCCTCA GCTACCACGT GATGAATCAG CAATACCAAG AGCTTGGCCA ATATCGGGCT
GGCAATGGCT ATCAGATCGA TTTTCATGAA TTTTTGATGC TGCCAAATGG CCATGTCATT
TTTATGATCT ACGATGATAT TCCCTATGAT TTAAGCCCTT ACGGCGGCGA AGAGAATGCC
ATCCTGACCG AATTGGTGTT GCAAGAGCTG GATACTGCTG GTAACGTCGT CTTCCAATGG
CGCTCGACTG AGCATATTCC AGTTTATGAT AGTAGCCATA GTTTGGCTGG AACGGCTCCG
GTCGATTATA TTCATGGCAA TGCGATTGAT GTTGATACCG ATGGTCATTG GCTGGTTTCA
AGCCGCCATA CCGACGAAAT TACCAAGATT AATCGCCAAA CGGGCGCGGT TATTTGGCGC
TTAGGTGGCG AGGGCAATCA ATTTCTCTAT TTGGAAGATA GCCCGCGATT CTACCATCAG
CATGATATTC GGCGCTTGGC CAATGGCAAT ATTATGTTGT ATAACAATTG GAACACCTTG
CCCCGCTCGC CGGATTCGTT CTCGGCGGCG CTGGAATATG AGATCGATGA AGTTGCGAAA
ACTGTGCGTT TGGTTAAGCG CTATCGGGCA ACTCCCGACT ATTTTGCCAC AGCGATGGGC
AATGCGCAAC GCCTGCCCAA CGGCAATACT GGCATCGGCT GGGGCAGCAT TCAGCCCTTG
TATACCGAAT TCAACTCTCA AGGCCAAGCA GTTTTTGAAT TAACTGCGGC GGCACCGATG
GTGAGTTATC GCTCGATGCG CTTTGAATGG CAAGGTGACC CACCATGGCC GCCAACTTTA
GTGACCCAAA GCCTTGCTAA CACCACCAAC TTATACTATA GCTGGAATGG TGCGACCGAA
GTTGCCGATT ATCAGGTGTT TACTGGGGTT ACTAGCACAA CCTTGAGTTT GCAAAATACC
ACACCCAAAA CTAGCTTTGA AACCAATACG ACGGTGGTTA ATAGTGACCA TTGTTTTGCC
CAAGTCCGTG CCCGTAATAG CCAAGGCACA GTTTTAGGTT CCTCGGAAAT TGCCTTCTTG
GCCAGCGATA CCTGTACTCC TAACCGCATG TATTTGCCAG CCCTAACCAC CCAATAA
 
Protein sequence
MRQLFGCLCI LIVVLSFSFS PRQPAQAAPM IEYRDPAPFS KAVKPTNTIA IRLGPSLTKA 
AVATVEFQVV GSRSGLHAGD VVLAEDQRTV IFKPASPFVF GETVRVSIKS SDIAELDQQT
WSFEVVERLV KQTDQVKQLQ TELAAELATQ AKAAQPSGSS PVLRTVPFNL PPLTVTLAIS
NTPGYIFVSP FSWISNVTPN RYLMMVDNTG APIYYKGLGS GRFSLDFRKI AEDKLVYFDT
STLSYHVMNQ QYQELGQYRA GNGYQIDFHE FLMLPNGHVI FMIYDDIPYD LSPYGGEENA
ILTELVLQEL DTAGNVVFQW RSTEHIPVYD SSHSLAGTAP VDYIHGNAID VDTDGHWLVS
SRHTDEITKI NRQTGAVIWR LGGEGNQFLY LEDSPRFYHQ HDIRRLANGN IMLYNNWNTL
PRSPDSFSAA LEYEIDEVAK TVRLVKRYRA TPDYFATAMG NAQRLPNGNT GIGWGSIQPL
YTEFNSQGQA VFELTAAAPM VSYRSMRFEW QGDPPWPPTL VTQSLANTTN LYYSWNGATE
VADYQVFTGV TSTTLSLQNT TPKTSFETNT TVVNSDHCFA QVRARNSQGT VLGSSEIAFL
ASDTCTPNRM YLPALTTQ