Gene Haur_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5003 
Symbol 
ID5736962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp4800 
End bp9140 
Gene Length4341 bp 
Protein Length1446 aa 
Translation table11 
GC content60% 
IMG OID641282170 
Producthypothetical protein 
Protein accessionYP_001547761 
Protein GI159901515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTCG ACACCTTGCA GCAGCATCTG CTGTCGGCCT ATGACTCCAC CGCCCGCAGC 
TTGCGCTTGG TTTCCGCAAC GCTTGGTTCG GCCCCGATCG CGGAACTGTT AGGCCAACCC
TATCTCCAAA TCCAGACACT GGAGATTACA GACACGGATG GCCCTCTCAC CGATGGTAGC
GCTGTCGTGG TGAGCGGCCT GAGCACCCTG TTTGGTTTCA CCTTGGTCCA GGTCGCAGCC
AGCTTCACGC TTGACGACCA AGCAATACCC CAGCTCGCCC TCAAGCTGCT GCTACCAGAC
GGCACATCGG CTACGGCTTG GCGCTTCGCT GATAGCTTCC CCCAGTTGGC GGGCACAATC
TTCGATGATG TCACCCTTTC TCCTGGAACG ATGCCGTTCC TGCTCTTCGC CTCGGCACCC
TGCGACGATT CCGGTACTAG CTTCGCTGCT GGTCTGAGCT TTTATGGCGC ACTCACGCCG
GGCGGCCCCA GTTTCGGCTA TGTCACTGCG GTGCTGGGGA CACTGGCCAG CGAGATCGCA
ACTGGCCCGA TCATCCCGCT GCCCAACGGT CCAACTATTA ACCTTGCGCT AGACTTGGCA
TCGAGCTTCA ACAAATATTT TCCCAGTTTG AGCCAAGAAT TACCGGTGGA ACTGCGGCTG
CTGAGCGCCT TTGACCAGAG CATGCCGCCG ACCGCACCGC GCTCTATGGT CGGGCTTGAG
CTTAGCACGA CCTTCGATCT TGGTACTGGC TCGATCCGAC TGGCCACCAT GCTGACGGGT
ACCCGGATCG GTGTGCTCAA ATTGCGTGCC AGCATCGACA ATTTGCCATT GCCAGCACCT
GGCCAACTGG TGGCGCTGCT GGGCGGTGAC GAGTTGGTTG GCTCGTTACC TGAGCGCTAC
CAGAACCCCG ACACGGTGCG GCTCAGCGGC TTCGGCTTTG GTATCAGCTT GGCTACGCTC
CAGCCGGTCA ATCTCTGGCT CCAGCTCGCC GCCCTCGAAG GGGAGGGCTG GGAGATCATC
CCGGGCGACC TGACGCTGAA ACGGGTGCGA GCATGGTTTA CCGTCAATAA CCCGCTCGAA
AGTACGCGCA GCGCCCAAAC CGCCATCTTC GCCGAGCTCG ACGTACCCAG TGCTAATCCA
CTCTTTTCGA TGGAGGTGCA TGCGTATGCG CCCAACTACC GCATTCAGGC CGCGCTGGTC
GAAGGCACGA CAGTTAAGCT AACCGACCTG CTCGCCGTGT ATCTGCCGCA GGTCACAGAC
GCGCCGGAGT TTCTGCTGGA GGAGCTTGGG CTGGCGGTTG AGTTCGCTAG TCCAAAGAAC
CGTCTGACCT TCGAGACGAC GATCGAGCAA GACACCCCGT GGATGCTGCC GTTGGGCGGA
CTGGAGCCAC TCCAAGTTCA GTTCATCACT ATTGCCTTGG ATAACTTTAG CAACGGCGAC
AGCATGGGCG GCCTGATCGA CGGCCAGCTT ACTATCTTGG GAGCCCAGAC CAGTTGCTTC
TACCAGCTTC CTGGCTCGTT TCGCATCAGC GCCCACATCC CAGCCTTTGA CGTTAACCTG
AAACGCATCG CCAGCGAATT GGCTGGCAGC GATTGGGTGC CGCCTAGCTG GCTGCCAGAT
TTTACCCTGC CGCAGACGTA CCTCGCGGTC GAGCGCGATC GCGATGGCGA GCAGTCGATC
TTTACGCTGC TGCTCCAGGC TGAGCCAGCG GGACTCGGCA CACTGGGCTT ACAGGTGCTA
CGCGAGCGGG GCAGCTGGGG CTTCGCCGCT GGCGTAGACC TGATCGCCGA CCGCGTTTCA
GATGTGCCTG GCTTGGCAGC GTTCAAGCCG TTCGACGATC TGTTCCAGTT CAGCAACCTA
CTCTTGGTTA TCTCAAGCAT CGCGAGCCCA GCCTTTACCT TCCCCGACAT GAGCCACCTC
GGGTCCAGCG GCAGTGGGGG TCGTCAGATC GTGCTGCCCA GCCAAGCCGG CGGGCTGCGG
TCTGGCCTCA CGCTCTATGC AGATATCACA CTGAGCAACT CTCCGACACT GACGATGCTC
CAGACCTTTC TTAAGATCTC GGGCGACATC GGTATCACCA TGCTCTTGGG TGAGAATCCG
ATGCAGAATG CACGGCTGTT CGCCAGCGTT GACGTTCGCG TGCTCACCAC CCTGCGCATC
GTGGCTGAGT TCGGTGCCAC ACTCCAAGGC AGTGAAACTG GGCTATTTCT CCAAGGCCAG
GCACTGGTCG TGCTCGCTGA TCAGCCGCTG GAGTTTGACA TGGCGATGCT CCTCGTCGAT
GACGGTGCAC TGATCGCCGG GAACCTCAAG GGCGGCCCGG TGAACTATCA CAGCCTACAA
ATCAGCAATC TCGCGCTGGA GCTGGGCGTG GATTTCGAGG GCGTACCCAG CGTTGGCTTT
GCGGCGACGA TCGACACGCC GACCTTCGAA AGTTCGCTGG CGATCTTCTT CGACTCAACC
GACCCGGCCA ATAGCATGCT CGCCGGCGCG GTCAGCGACC TCACGCTGCG CGACGCGGTG
GAGGTGCTTG TTGGTGGAGT GCTACCAGTG TCGCTTGGCG AGGCACTAGC CCAAGTCGGA
CTCACGGGAA CAGGGCAATT TCTCCTGTCG GGCGACTTGT CCACAGCCTT GGACAGCTAC
GACATGACGG CGGTCGCGGC GGCCCTCCAA GCGGTCGGCG TGACTTTTCC CCAGCAGGGT
CAATCCGCGC TGTTGGTCGT GAACACGCCC GGCCATGTCT GGCACCTGAC CGATCTGACG
ACCATGCGTC ACTTTGAGTT TGAGCGCCAG GGCAACCAGA TCGTGGGTGC GTTGGAGGCA
CAACTCTACT GGGTACCAGG TTCACGTGGT GCCTACATTG GTCAGACCTA CTTTCCGAGC
GGCTTTCTGC TGAGTGGTAC GATCGTCTTC TTTGGCCTAG AAGTCAGCCT GTCGTTCACA
ATTAGCCGAG GGCAGGGCAT CAGTGCTGAG GCACAGCTAA GCCCGATCGC GATTGGCGGA
TCGCTGCTCA CAATCACCAG TCTAGGCGGC AAGGCTGGCC CCCATCTCTC ACTTTCGACG
GCGGCTCCAG AGCACTTTTT CCTGAGTGGC GACATTCAGT TGCTCGGTAT CATTGGTGTC
GGCGTACAGA TCAGCATTGC GCAAGCCGGT CTGCTCTTCG ACCTGCGCGG TGAGCTGCTC
CCGGGGATGA CCTTTGACCT GCACGCGAGT TTCACCAGCC TGCACAGTTT CAGTGCAAGC
GGATCGGTGG TGCTGGGGGT CAGTCGCATG CTCGACTTCG GGCCGCTGGG GTCACTCAGC
CTCAATGTCG GTATCAACGG ACAGCTAACG ATCGCCCACC AAGATGGTGT GAGCAGTGCT
ACCTTCCAAG GTGGATTCCA GTTCGGCACG AACGCGTACA GGGTTGGCCC GATTACACTC
GACATCAGCC GTGCGGCCCT CGCCGAACTG GGTGAGACCG TGGCACACCA TCTCGAGGCA
GTTTTCCATC ACGTGATGCA CGACCTGCGA GGATGGTTCG AGTTGGTCAA ACAGAACCTA
ATCCAAGGCA TCAGCGGGAT GGGCCAGATT ATCCACGTGT TGCGCCAGCA CTTCGGCCAA
GACAAGCACG CCGTCGCGAC GCTGCTTAAG GAGTTGCTCG TCCAAGGTGT TGAGCCAATC
GCCGATGCTC TGCGCAGCAC CTTCGAACTG GATAGCCGTG GGCTGGCACG CTTGCTGCAT
GACACCGGCT ACGCGGTCGA GGATGTGACG CGGGCGCTGC GTACGGAATT CGACAAAAGC
CACCGTGAGG CAGCGGCGAT CCTCAAGGAG GTTGGTTACT CGGCGGATGC GGTGGCACGG
GCGCTCCTCC AGAAGTTCGA GAACGATCGA ACGCGGGCGA TCCAAGTGCT GCGCGAGGTA
GGCTACGATG CCCGCGAGGT GACGGGTGTC ACCGTGGGGG TCTTCCAGCA GTCGGCGGCG
GACACGGTGG CACTGCTCAA AGCGGCCGGC TACGAAGTCG AAGAGGCCGC CCGTGGCTTA
CATCAGGCCT TGGCCTTGGA TAGCCGGGCT GTGGTGACGC TGCTCGGACA GTCAGGCTAT
GACGTGAAGG CGGTCGGACG CGCGTTACAG CATACCTTCG ACAAAAATGC CAACGAGGCA
TCGCGGATTC TCAAGGAACT GAATTACCCG ATCCACGATC TGGCCAAGGT TTTGCGCCAC
GTCTATGATA AAGTAGCCAA GGAGGCTGGC AACGTGCTCA AGGAACTCGG TTATGCCAGT
AAGGATGTCT CGGATTCGCT CCAAAAGGTC TTTGACTTAG GAGAAAAAGA AGTCGAGAAG
CTCATCAAGG ACATTTTCTA G
 
Protein sequence
MTLDTLQQHL LSAYDSTARS LRLVSATLGS APIAELLGQP YLQIQTLEIT DTDGPLTDGS 
AVVVSGLSTL FGFTLVQVAA SFTLDDQAIP QLALKLLLPD GTSATAWRFA DSFPQLAGTI
FDDVTLSPGT MPFLLFASAP CDDSGTSFAA GLSFYGALTP GGPSFGYVTA VLGTLASEIA
TGPIIPLPNG PTINLALDLA SSFNKYFPSL SQELPVELRL LSAFDQSMPP TAPRSMVGLE
LSTTFDLGTG SIRLATMLTG TRIGVLKLRA SIDNLPLPAP GQLVALLGGD ELVGSLPERY
QNPDTVRLSG FGFGISLATL QPVNLWLQLA ALEGEGWEII PGDLTLKRVR AWFTVNNPLE
STRSAQTAIF AELDVPSANP LFSMEVHAYA PNYRIQAALV EGTTVKLTDL LAVYLPQVTD
APEFLLEELG LAVEFASPKN RLTFETTIEQ DTPWMLPLGG LEPLQVQFIT IALDNFSNGD
SMGGLIDGQL TILGAQTSCF YQLPGSFRIS AHIPAFDVNL KRIASELAGS DWVPPSWLPD
FTLPQTYLAV ERDRDGEQSI FTLLLQAEPA GLGTLGLQVL RERGSWGFAA GVDLIADRVS
DVPGLAAFKP FDDLFQFSNL LLVISSIASP AFTFPDMSHL GSSGSGGRQI VLPSQAGGLR
SGLTLYADIT LSNSPTLTML QTFLKISGDI GITMLLGENP MQNARLFASV DVRVLTTLRI
VAEFGATLQG SETGLFLQGQ ALVVLADQPL EFDMAMLLVD DGALIAGNLK GGPVNYHSLQ
ISNLALELGV DFEGVPSVGF AATIDTPTFE SSLAIFFDST DPANSMLAGA VSDLTLRDAV
EVLVGGVLPV SLGEALAQVG LTGTGQFLLS GDLSTALDSY DMTAVAAALQ AVGVTFPQQG
QSALLVVNTP GHVWHLTDLT TMRHFEFERQ GNQIVGALEA QLYWVPGSRG AYIGQTYFPS
GFLLSGTIVF FGLEVSLSFT ISRGQGISAE AQLSPIAIGG SLLTITSLGG KAGPHLSLST
AAPEHFFLSG DIQLLGIIGV GVQISIAQAG LLFDLRGELL PGMTFDLHAS FTSLHSFSAS
GSVVLGVSRM LDFGPLGSLS LNVGINGQLT IAHQDGVSSA TFQGGFQFGT NAYRVGPITL
DISRAALAEL GETVAHHLEA VFHHVMHDLR GWFELVKQNL IQGISGMGQI IHVLRQHFGQ
DKHAVATLLK ELLVQGVEPI ADALRSTFEL DSRGLARLLH DTGYAVEDVT RALRTEFDKS
HREAAAILKE VGYSADAVAR ALLQKFENDR TRAIQVLREV GYDAREVTGV TVGVFQQSAA
DTVALLKAAG YEVEEAARGL HQALALDSRA VVTLLGQSGY DVKAVGRALQ HTFDKNANEA
SRILKELNYP IHDLAKVLRH VYDKVAKEAG NVLKELGYAS KDVSDSLQKV FDLGEKEVEK
LIKDIF