Gene Haur_5234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5234 
Symbol 
ID5737192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp1014 
End bp2927 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content58% 
IMG OID641282398 
Producthypothetical protein 
Protein accessionYP_001547989 
Protein GI159901744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAC CACCAGCAGC CCAACCCGCG TCCAGTGGCG ACGGCGGCGG GGCAATGATG 
GGCTTTTTCA TGGGCCTGAT CGTGGTGATC GTGGCCTTGC CCGCAGCGAT TGGGGCATTG
TGGGGCAGAC GCTATATGAA AACCCACCAG CCGATGGTGG CGATGCTTGC GTTAGCAGGC
GCAGGGCTGA TGGCCTTGGT CGTGTGGTCG CCGTTGCAGG ATCATTTAAA GGAGAGCCGC
ACGGCGATGG AACGGGAAAA ACGCCGCGAT GGGATCATGG GCGTAGTCAG TGCGGGAGCC
GGAGCGATTC CGCTGTTGTG GCTCTACACC ATCCCGCTCG CGCCTGCGTT GACGATGGTG
TGGGAGAGTG TGCGGCCCAA ATCCTTAGCC GAGCAACAAG CCGACAAGGA TGCCCAAGCC
CAGGAAAAGC GCACGCTTCA GCTGGACTCG GCCAAACGAC GGGCGCTGAA AGCGAGTGCG
GCGAGCCTCA CCCATCGGCC TGTGACCAAG GAGATTGATA GCGCCACCGT CTTAGGGGCC
AAGATTCAGG GCGATCCGTT GTTCTTTGCC AATGAGCATA AGCGGTTATT ATATATCACA
ACGAGCATTG GCGCAGCCAG TCTGCATATG CTGTTTATTG GCGAGAACGG AAGCGGGAAG
ACCATTTCAA TGTTGCGTTT TGCTGCTTCG ATAGCGGCTT CCACGAATTG GGACATCTTT
TTCATCAACC CCAAAAACGA TGCCAAAACG ATGCAAGAAT TTTATGATGT GATGGCGTTT
TATGATAGGC AATGTCGCTT GTTCCCGCAA GAAGCCTATA ACGGCTGGGA GGGGGATAGC
GGGGCGTTGC TCAGTCGGAT TATGGCGATT CCGGCCTATG CAACCGAGGG CGCGGCTTCA
TTTTATGCCG ACATGAGCGA GGTCTACTTA CGAGCGGTCT TGAACACCGA TGAGGCGTTG
CCAAGCTCGT TTGAAGAACT CGAAGAACGC TTGCAATATG GCCGTTTGGC CGACCAATAT
AAAAACAACG CGCAGGGGTT TGCGCGGGTG GCCAGCATCA GCGCAGCGGA TGCCAAGAGC
GTCTTTATGC GCTTTGCGAC CATGACCCCC AAACTCACGC AAATTCGGCG GGATGGCTGG
CGATTGAGCG AGGCCCGCGC CGCCTATTTT GGCCTGCCCG TGTTGGCCAA CGAGCGCGAT
AGCCAAAGTA TTGCCAAGTT TCTCTTGGAG GACATCAAAC ACTACTTGTC CACCCGCAAG
CCCAGCGACC GCCGCACGGT GCTGATTATC GACGAATTTT CATCCCTCGG AACCGAGAGC
GTGATCCGGT TGGCGGAAAT GGCCCGCAGT TTGGGCGGGA TCGTGATGCT CGGAACCCAA
ACACTGGCGG GCCTTGGTGA TGCCGACCAA CAGGCGCGGA TTGTTGGCAA TATGACCGTG
GTGTTACACC GCATGAGTGC CCCCGAAGAA CTGACCAAGC TCGCGGGTGT TCAAAAGGTG
ATGACGACGA TTCACCAGTT TCAGGGCAAG CAAATTTTGA AGCGCGGAAC CTACCGCATG
GAGGAGGAGG CGCGGATCGA CCCACAGGAT GTCAGAACCT TGCCCACAGG CTGCGCGTGG
GTGATTGCCC GTGGGGCAGC GGCCAAAGTC CAAATCGCGA TGATGCCCGC GGTGCCGCAT
GTGCCGATTG TCATCCAGCG ACCCCGCCGA GCGCCGACCC CTGCCCAGGC ATTTGCGCCT
GTTCCGGAGG ACGCGGCCAG TTTTGGAGCC ACGGTTACCC CAATGGCGGA ACCGCCCAAC
CCCCAGCCGA TGATGGATCA CCACGACGAC CCCGCCGTGA TAGAGGAGGA AGCCCATGCC
CACGACGCTG ACGAACGCTT CAGCTTTGGT GCGGGTCGCG TTGACCCCGC GTGA
 
Protein sequence
MSQPPAAQPA SSGDGGGAMM GFFMGLIVVI VALPAAIGAL WGRRYMKTHQ PMVAMLALAG 
AGLMALVVWS PLQDHLKESR TAMEREKRRD GIMGVVSAGA GAIPLLWLYT IPLAPALTMV
WESVRPKSLA EQQADKDAQA QEKRTLQLDS AKRRALKASA ASLTHRPVTK EIDSATVLGA
KIQGDPLFFA NEHKRLLYIT TSIGAASLHM LFIGENGSGK TISMLRFAAS IAASTNWDIF
FINPKNDAKT MQEFYDVMAF YDRQCRLFPQ EAYNGWEGDS GALLSRIMAI PAYATEGAAS
FYADMSEVYL RAVLNTDEAL PSSFEELEER LQYGRLADQY KNNAQGFARV ASISAADAKS
VFMRFATMTP KLTQIRRDGW RLSEARAAYF GLPVLANERD SQSIAKFLLE DIKHYLSTRK
PSDRRTVLII DEFSSLGTES VIRLAEMARS LGGIVMLGTQ TLAGLGDADQ QARIVGNMTV
VLHRMSAPEE LTKLAGVQKV MTTIHQFQGK QILKRGTYRM EEEARIDPQD VRTLPTGCAW
VIARGAAAKV QIAMMPAVPH VPIVIQRPRR APTPAQAFAP VPEDAASFGA TVTPMAEPPN
PQPMMDHHDD PAVIEEEAHA HDADERFSFG AGRVDPA