Gene Haur_5106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5106 
Symbol 
ID5737064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp139311 
End bp141884 
Gene Length2574 bp 
Protein Length857 aa 
Translation table11 
GC content67% 
IMG OID641282271 
Producthypothetical protein 
Protein accessionYP_001547862 
Protein GI159901616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.141867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTTT ATAACTATGT GCTTGTTTTG ATGCTGCTCG TGGCGGTGCC GCGTGCCGCG 
CTGGCAGTGG CGACACCGCT CACCGTGACC ACGGCAGACC AAACCGTGGT GTTGGCGGGT
GTGCCGTCCC AACCGACCAC CGTGGCCGTG CGCGGCCTGC ACGCTGCCGC GCCATCCCTG
ACGATCACCA TGGACACAAC CACGCTGCCT GAGGCCGATC CGGCACGATG GGCCGACCTA
CCCACCAGTC CCGTGACCCT GCTGCGCAGC GGGCGCTATC GCGGCTATGG CGTATGGGTC
TATCTGGTGC AGCCGATGAT CCAGCAACGC GGCCAGATCC AGCAGGTGAC GCAGCTGCAC
GCCGTGCTCG ATGGGGCGGT GCTGGTGGCA TCCCCTGCCG ATCTCCAAGC GTTGCCGCGT
GTGCCGTTTG GTGATCCCGT GCCGCCGACC AACCCATTGG CTTTGGCCGA TCACACCTGG
ACGCTGACCG TGACCGAGCC GGGCATGCAG CGGGTCACGG GGGCGATGCT GGCCGCAGCG
GGCATCGACC TCACGACCCT CACTCCGGCG ACGGTGCAAG TGCAGCACCA TGGCGTAGTG
CTCCCGCTCG ATTGGCGTGG GGTTGGCGAT GGCGTGGTCG ATGCGCAGGA CGAAGTGCGC
TTGTGGGTGG ATAGCGTCGG CGACCGCTGG AACCGTGCTT CGACCCTGTG GCTCACGACC
GCTCCCGCCA CCCCGTCGCC GCCCATGGCC TCGCGCCTTG CCCTCGCCAG CACCGCGCCG
TTGACTGATA CGGTCTGGAT GACGCAAACG TGGGATGATC CGCAGATCCT CGACAGTCGC
CATGCGGGCA TGCGCGGCTG GCACACCTTC AGCACCCGCC TGAGCAGCCT CGCGGGTGGC
GATGCGCAGA CAATGACGAT CCCAGTGACC GCAACGTTGC CATTGGCCAG CGGCCTGATG
ACCGTTACCC TGCGCGGCGC GACGGCGACC AACCTACCCG TGCCGCTCGT GGTGAATACC
GTGCCGCTGA CCGTGCCTGC GACTGCGGCA TGGCAAACGA CGCTGGCCGT CTCCACGAGC
GCGGCCATCA CGGTGACGCT GCCTGCTCCG GCCATCGGCG CGGCCAGTGT GCTGCTCGAA
ACCATCACCG TGACCCGTCC GACGCGGCTC GCGACCATGC CGACTGACCC ATGGGAGAGT
GGGTCAACGC CTGCGCGGTA TGCCCTGCCG GACGCGCCGC CGCTGCGCAC GCTCTATGAT
GTGACCGAGG TACAGTCGCC GCAGATCGTG ATCCTGCCTG CCGGACCCAC GCCCGTGCTG
GCTGATCCGT TGGTCAATCG GCGGTATCTG CTGGTTGGTG CGTCGCCACT GCCCACGCCA
ACGCTTACCC GCCATACCCC TATCGTGCTG CTCACCGTCG GATCGGACGT GATCATCGCC
CCACGCGCCT TCCTGCCCGC CCTCGACCCG CTGCGTACAC CGACCACGGT GCTGGTGGCG
CGGGAGGATC TCGATGCGGC ATGGGCGTTT GGCCACGTAT CCCCGATGGC GATTCGCACC
TTTCTCCAGC ATGCAGCGGC GACGTGGCCA AGCCCGCCCA CCAGTGTGCT GCTGGTCGGC
GATGGCACGA CTGACCCGCG TGATGTGCTT GGCTATGGTC AGCCACCGCT GATCCCGCCT
TATCTGGCCG AGGTCGATTT ATGGCTGGGC GAAACCGCCT GTGAAGCCTG TTATGGCCAA
TTGGACGGCG CTGATCCACT CAGTGACCTG CTACCCGATC TGCCCGTGGG CCGCTGGCCC
GCCACAACGG TGGAGGACGT GACCGCCCTG ATCGCCAAAC AGCAGCGCTA TGCGGCGGCC
CCGTGGGGGG CGTGGCAAAG CACAGTCGGG AGTCTTGCTG ATAATGCCGA AGGAGCGCTC
GACTTTCCGC AACTGGCGGC GCAGAGTGAG GCCGTCTATC CCCTGACGAT GACCTTGCAT
CGCGCCTATT ACGCCCCACA GGCCACGAGC ATTGCCCCCG CGTGGCATGA AGCGGATGCT
CGTGCCGTGC GCGAGCGCGT GCTGGCGATC TGGCAGGCGG GGGCGATGCT GATGCAGTAC
ACGGGCCATA GCCACGCCTA CCAATGGGCC GTGACTGATC CCGTGGTCGA GCCGCGTGGG
TTGCTCGATC TGAATGCGGT GGGCGACCTG CACAATGGCG AACGCCTGCC GCTGCTGCTG
GCCTTGACCT GCCTGACCAG TGCCTTTCAT CAGCCCAGCC CCCGTGGAAC CACGCTCGAT
GAGGCGCTGG TGCTGCATCC TGACGGCGGG GCGCTGGCAA CCTGGGGATC AAGTGGTTTG
GGGGTCGCCC ATGGCCATGA TCACCTTCAG CATGGGCTGG TAACGGCGGC GCTGACGATG
CCTCGGCCAA CCTTGGGGCA GGTGACGGAG GCGGGAGTGC TTGAACTGGC GCTCACGGGG
CACTGTTGTA CCGATGCGCT GCGCACCACC CTGCTCTTGG GGAATCCGGC CACGGTACTG
CGGGTTGCGC CTACGCCGCA GCAGGTGTGG CTGCCATTGG TCGGGTGGGA ATAA
 
Protein sequence
MRVYNYVLVL MLLVAVPRAA LAVATPLTVT TADQTVVLAG VPSQPTTVAV RGLHAAAPSL 
TITMDTTTLP EADPARWADL PTSPVTLLRS GRYRGYGVWV YLVQPMIQQR GQIQQVTQLH
AVLDGAVLVA SPADLQALPR VPFGDPVPPT NPLALADHTW TLTVTEPGMQ RVTGAMLAAA
GIDLTTLTPA TVQVQHHGVV LPLDWRGVGD GVVDAQDEVR LWVDSVGDRW NRASTLWLTT
APATPSPPMA SRLALASTAP LTDTVWMTQT WDDPQILDSR HAGMRGWHTF STRLSSLAGG
DAQTMTIPVT ATLPLASGLM TVTLRGATAT NLPVPLVVNT VPLTVPATAA WQTTLAVSTS
AAITVTLPAP AIGAASVLLE TITVTRPTRL ATMPTDPWES GSTPARYALP DAPPLRTLYD
VTEVQSPQIV ILPAGPTPVL ADPLVNRRYL LVGASPLPTP TLTRHTPIVL LTVGSDVIIA
PRAFLPALDP LRTPTTVLVA REDLDAAWAF GHVSPMAIRT FLQHAAATWP SPPTSVLLVG
DGTTDPRDVL GYGQPPLIPP YLAEVDLWLG ETACEACYGQ LDGADPLSDL LPDLPVGRWP
ATTVEDVTAL IAKQQRYAAA PWGAWQSTVG SLADNAEGAL DFPQLAAQSE AVYPLTMTLH
RAYYAPQATS IAPAWHEADA RAVRERVLAI WQAGAMLMQY TGHSHAYQWA VTDPVVEPRG
LLDLNAVGDL HNGERLPLLL ALTCLTSAFH QPSPRGTTLD EALVLHPDGG ALATWGSSGL
GVAHGHDHLQ HGLVTAALTM PRPTLGQVTE AGVLELALTG HCCTDALRTT LLLGNPATVL
RVAPTPQQVW LPLVGWE