Gene Haur_5183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5183 
Symbol 
ID5737141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp263116 
End bp265530 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content54% 
IMG OID641282347 
ProductXRE family transcriptional regulator 
Protein accessionYP_001547938 
Protein GI159901692 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.621684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACGT TTGGACGATG GCTCAAACAC CAAAGGATCA GCCACGATCT CACGCAAGAA 
GCGCTGGCTG AGCGTATTGG CTGTTCGGTC TCGCTCCTAA AAAAGCTTGA GACGCATCAG
CGCCGTCCCT CAAAACAGAT CATCACCCGT CTAGCAACCA TCTTCAAGGT GGACGCAGCG
GTTGTGATCA AATGGCGAAC AACCGATCCC ACTCCCGAAG TAAACCCTGT TGGTGCGTAT
CTACCGATAC CACTGACTCC CCTCGTAGGT CGTGCCACCG ATGTTGCGGC ACTCCAGATA
CTCTTGAGGC AGGACACCGT GCGCTTCCTA ACACTGACCG GACCAGGGGG AGTAGGAAAA
ACACGGCTTA GTCTGGCATT AGGTGCAGCT TGCCAGCTAT GGTTCCCTGA TGGCGTATGG
TATGTACCAC TTGCTCCTGT ACACGATGCC GACGGCGTGT TGCCAGCCAT CATGCACGTG
CTGCGACTTC CGACCCCACC CGACCAGACC GTGCTCGAGA CGATCCAAAA CATGTTGAGA
CACCGCCACG CGTTACTGAT TCTGGATAAT TTTGAGCATG TACTCGCTGC TGCCCCGATG
ATTGCAACAC TCCTTGAAAG CACAACCCAG TTAAAAATCG TCACAACGAG CCGGGCCCTA
TTGCATATCC CTGGGGAACA CTGTATGGTC GTGCATCCCT TAAGCCTCGT CGTTGAACCA
ACGGGTAAGG GGGTTCATGC GGCGCATTCG GCAGCGATGG AGCTTTTTCT TCAACGAGCC
ATCGCAGTGA ATCCCCAGAT CGTCATCACT GACGAAGCCT TAGCCGCAAT CCGCACCATC
TGTGTAGAGC TGGAAGGACT GCCGTTAGCC ATCGAGCTTG CTGCCGCTCG CTGTCAGATA
GTCACGCCAC AGGAGCTTCG TACCGTGTTT CGGAGCCGCC TGATGCTAGC CAAAAGTACC
AACTATACAC GGCCGCAGCG CCATAGTAGT GTGTGGGATG CGCTCCTATG GAGCTATCAA
TTACTTGATC CTATGGCCCA ATCTATTTTT CGTACGCTGG GTGCCTGCGT TGATGGAGTT
CTGCTCCCCA CGCTCCAAGC GATGTTCGCG AACGAGGATG AAGCCGCTCC ATCGCTGATG
GACTATCTTC ATCTCCTCGC GAACCACAGC CTCATCAGCC TCGAACCAAC GACGACGGGT
GTTCTGGCAC TAACGATGTT GGAAACGTTG CGAGAGTTTG CTCACCTGCT GCTCGTTCGT
CATGGTGAGG AATTGTATGC ACGCCGCCGC CATGCTGACC ACTATCGGGC ATTTCTAGTA
GATATTAATA TCAGGCTCGA GAAGGCTGAT AGTGTCTCGT GGGTCGCTCA ATTTGACCAT
GAGGCCGCAA ATATCGATCA CGCCTTACAC TGGCTAATGA CCCACGATCC TGCCGTTGCA
GCTGAGTTCG CGAGCCACCT TGCTCCGGTT TGGATGAATC GGGGATCGAT CCAGTATGGC
ACTATTTGGC TAGAACGGTG TATCACCCTA CCAAGGCTTG ACCCGCTTTT AGAGGCCATG
CTTGCGCGTG ACTTGGGAGC CTTTTGGATT ACGGCGGGTC GTTACGCAGA TGCCGAGGCA
GTACTCATTC CCGCCTTGGC CATGTTTAAA TCTGCGCGTC GAGAGCGGGA TCAAGTGCGT
ATTTATTTTA TGCTCGCATA TGCTGCCATA CAGCAAAATC GCTTTGTCAT CGCCGAGCAG
TATCTTGCGC GCTGTGAGGA TTGGGCCCAT AGCAATGCGG AAATAGAACG ACTCACGATT
ATTTTCCATA ACCAGGCTCA GATGTTTGAA CAGCAGGGGA ACTATGAGAC CGCTCGCATA
AAAGTGCAAG CCATGCTCAA CCTTTGTCAA ATCCTAAAGT ATCCCGCCAA TAGCGCCTTG
GCTTGGACTC GCCAAGGATT CATCGATCTG GCGCAGGGTA GCGTGGATGC AGCAGAACAT
GCCTTGTTCC AAGCCCGCAG GATGCTTGAA CAGACAATTA GTTTTGATCG TGGTGACCTC
GTCGATATGC GTCGGCTGGA CGGGTTACTT GCGCTTGCCC ATGGCGATCA TCGGCGTGCA
CGGGTAATCC TAGGCCAGAC ACTCGCCATG GCTGCTGAAT TGAACGATAT CCCGCGTGTG
GTTCAGACGC TTGATGCCTG TTTATGGTAT ACCTATCGTA CGAAAGACCT GCACATCTCC
GCAGCCCTGC TTGGGTTGCA ACAGCGTTTG CGGAGACACT ATGCCATGCC AGCACCACCA
CCCGTCCAGT CACAACTGGA TTCACTTGCA ACCTCTCTTA CGAATGGGCT GGAGCCGGAC
GTGCTAGGTC ACTGGATGCA ACACGGTGCG ACATGGGTGA TCGCTGATGC CTGTCGGCAG
CTTCTGGCGG ATTAA
 
Protein sequence
METFGRWLKH QRISHDLTQE ALAERIGCSV SLLKKLETHQ RRPSKQIITR LATIFKVDAA 
VVIKWRTTDP TPEVNPVGAY LPIPLTPLVG RATDVAALQI LLRQDTVRFL TLTGPGGVGK
TRLSLALGAA CQLWFPDGVW YVPLAPVHDA DGVLPAIMHV LRLPTPPDQT VLETIQNMLR
HRHALLILDN FEHVLAAAPM IATLLESTTQ LKIVTTSRAL LHIPGEHCMV VHPLSLVVEP
TGKGVHAAHS AAMELFLQRA IAVNPQIVIT DEALAAIRTI CVELEGLPLA IELAAARCQI
VTPQELRTVF RSRLMLAKST NYTRPQRHSS VWDALLWSYQ LLDPMAQSIF RTLGACVDGV
LLPTLQAMFA NEDEAAPSLM DYLHLLANHS LISLEPTTTG VLALTMLETL REFAHLLLVR
HGEELYARRR HADHYRAFLV DINIRLEKAD SVSWVAQFDH EAANIDHALH WLMTHDPAVA
AEFASHLAPV WMNRGSIQYG TIWLERCITL PRLDPLLEAM LARDLGAFWI TAGRYADAEA
VLIPALAMFK SARRERDQVR IYFMLAYAAI QQNRFVIAEQ YLARCEDWAH SNAEIERLTI
IFHNQAQMFE QQGNYETARI KVQAMLNLCQ ILKYPANSAL AWTRQGFIDL AQGSVDAAEH
ALFQARRMLE QTISFDRGDL VDMRRLDGLL ALAHGDHRRA RVILGQTLAM AAELNDIPRV
VQTLDACLWY TYRTKDLHIS AALLGLQQRL RRHYAMPAPP PVQSQLDSLA TSLTNGLEPD
VLGHWMQHGA TWVIADACRQ LLAD