Gene Haur_5124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5124 
Symbol 
ID5737082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp168633 
End bp171488 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content53% 
IMG OID641282289 
Producthypothetical protein 
Protein accessionYP_001547880 
Protein GI159901634 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATA GCACTGATGG TTCCGCTTAT GTGCATGATT CGCATATTAA TGGGGTTGTT 
GTCGGAACGA ATCTTGGTAC GATTCTCTAT GGCCGTCCAC CAGAAGAAGC CGAGCGCGAT
CTTTTAGTAC GATATTTGCA ACGAGTGACG GAGTTGCATA GCACGATGCC TGTGGTTGGG
CTTGGATCAT CACGGCTCGA TGCTGGTCTT GACCTTGCGT CGGTCTATAT GATGCTCGCG
GTGCAGGGCC GTTATCGGGC CGTGCGGACA CTCACGGCAG AAGAAACTGA AGCCTATCGC
CAACAGCGCT TTGTCATTCC CAAGGAACTC AGTACGGATC GGTGTTTGCC CGATCAGGCT
ATTGTGATGG TTAACACCAA TCAATCAGGT GAATTGGCCT TATTCCGCGC TGAACTCGCA
ACTGAAACGG TCTTAGACCA CCCGCACCTT GTGCTGTGTG GCGCACCGGG GTGTGGGAAA
TCGACCTTTG CGCATCATTT GGTGTGGGTC TTGGCCCAAC GTGGGTTGGA TCAGATTAAC
CACCATACGG GCTTGCTCGG TTGGAATGAC ACACAGCGCC TGTTGCCGAT TGTGATGCCG
CTGCGGCGTT TAGCAGGAGC CTTGGTGGGC ACCGATGTGG GGTTGACCGA TGCCATGCCA
AATGTTGGGT TGCTGCGTGA TGCGGTGTGT GCCCATATGC AAACGAAATA TGGCATTGAG
AAACCACACA CGCTGCTTGA TGCCGGATTA GCACGTTCGC TAAAAGTGCT GTTGGTGTTC
GATGGCCTGG ATGAAGTGCC ACTCGAAGCC TCCTCCACGA GCCTTGATCA GAGGACGGTG
CTGCGGTTTA TCCGTCGGTG TGCCGGGTTG AACGTCCGCA TCCTGATTAC GTGTCGCTCA
CGGGCATGGA CGGATGAGTA TCGCCAGATC ACGCAGTGGC CGATGGTTAA ATTGGCTCCA
TTGACGGGCG GCCAGATGAC CCAGTTTATT CACACGTGGT TTCCGCAGTT GCGTACCAAG
GGGGTGATTG CATACGAGGC GATTGCGCGA TTGAGCCAGC GCTTAGTGCA GGCATTGCGT
GACCCCCACC GTGAAAAATT ACGCAAAATG GCCGAAAATC CCTTATTGCT GAGCATGATC
ATCTTTGTAA TGGCGGATAC CGGCAATTTG CCCCACGACC GCGCCAAGCT CTACGAGCAG
ATCTTAGAGC AATTGCTGGA ACAGTGGGAT GCCAAACGGA ATGGGCACGA CCTGGCGCAG
GCGATTGGTG ATGAACGCAT TACGGGGAAA AAACTCCGTG ATTTCGTGCT GGATCGGCTG
TGCTATCAGG CCCATCTCAC AACGACATCG AACGATGGTC GTGGGCAAAT TGACGCAATG
CAGCTCAAAA AGGCATTAAT GGACTATTTT GCCAAAATCA AGATCAAAAC GAACGATCCC
TATTGGGCGG CTGAACGCTG TGTTGCCTAT ATTGATCAGC GGAGTGGCTT GCTTCAACCT
GACGATACAG GCAATGTCTA CACGTTTGCC CACCTGACGT TGCAAGAACA TTGCGCTGGC
CGTCATTTGT TGTTTGAGGA GCCGCTCCAG CAGATGTTAG CCGTCCGCCG TGATGATCGC
TGGCGTGAGC CGATCTTTTT GGGCGTTGGT TGCCTAGCGG ACGATAAACG GGCATCGAGC
AAAATTGGCG AGGTATTATC AGCGTTGATC AACCCTGATG AATTTCGGAG TAAACCGCGC
AAACCGAAAC ATCGTTATGA ATGGTATCGC GATCTGGTGC TTGCTGCTGC GATTGGAGCC
GATTGCGATT GGGATACGCT GAATGGGACT GATCTGGATG TTGAGTATTT TCACGGTGCG
TTGCGGTACG GCATTGTCAT CCTGCTTGAA GACCGTGCGC ATGCCCAAGC AGCACTGGAT
TACTACCACG GTCAACTCAT GGAGCCAGCG CCATTATTGG TGCGCGAACG GCAAAAAGCT
GCTGAGCTAT TAGCAGGTCT GGGCGACTCG CGTTATCCGG TGAATAGTGA CCAATGGCAA
CAGGAGACGC GTCAGCTTTC CACCCAGTTT AGTCGCGAGG GCACCCACTA TTGGCGCTTT
GTGCCTGCGG GCCACTATCA GGTTGGCGGT TGGTATTACG ATGAACAACC TCCAACCGTC
GTACTTCAAC CCTACTGGGT CGGGCGGTTT ATGATTACGG TGGAGCAATA TCAGGCATTT
ATTGAGGCAG GTGGCTATAC CAACGAGGAT TACTGGACGA AGCATGGTCG TGCCTATAAA
AAGCGTTCTA ACAAAATAGT GCCTCGCTGG TGGGATGATC AAACCGAGCA GGAATACCGC
AATCAGCCTA TTTATGGGGT GAGTTGGTAT GAGGCGGTGG CCTATTGCCA GTGGTTGACC
CAGCAGCTTA GCCCATTCTT ACCGCAGGGG TATGGTATTC GCCTTGCCAG CGAAGCTGAA
TGGGAGGTGG CAGTGGCTTA TACCACCGAT GGACAGCGCC AACCCTCTCC GTGGGGTGCA
CAACCTGTTA CGCCGGAACA TGCGATCTAT GATTGGAGTA CGAAAAACCG CCCCTTATCG
GTTGGTGTAG GATTGCTGGG GCAAGCGGCC TGTGGTGCAC TGGATAGCGT TGGCAACCTG
TGGGAATGGA CAGCTACGCC CTATCAGCAA AACCATGGTG CGGTGCAGCT GACGCTTGCA
GATAGTGACG ATGATATGGC GGTGCGAGGT GGCGCATATT ATAGTAGGAG TACAGATATT
CGTTGCACGG CGCGGCACAG GCTTCGTCCC GACTTCGACG ACTATCACCG AGGATTTCGT
TGTATGCTCG CCCCTGTTGT GAATGCTGAG TCCTGA
 
Protein sequence
MADSTDGSAY VHDSHINGVV VGTNLGTILY GRPPEEAERD LLVRYLQRVT ELHSTMPVVG 
LGSSRLDAGL DLASVYMMLA VQGRYRAVRT LTAEETEAYR QQRFVIPKEL STDRCLPDQA
IVMVNTNQSG ELALFRAELA TETVLDHPHL VLCGAPGCGK STFAHHLVWV LAQRGLDQIN
HHTGLLGWND TQRLLPIVMP LRRLAGALVG TDVGLTDAMP NVGLLRDAVC AHMQTKYGIE
KPHTLLDAGL ARSLKVLLVF DGLDEVPLEA SSTSLDQRTV LRFIRRCAGL NVRILITCRS
RAWTDEYRQI TQWPMVKLAP LTGGQMTQFI HTWFPQLRTK GVIAYEAIAR LSQRLVQALR
DPHREKLRKM AENPLLLSMI IFVMADTGNL PHDRAKLYEQ ILEQLLEQWD AKRNGHDLAQ
AIGDERITGK KLRDFVLDRL CYQAHLTTTS NDGRGQIDAM QLKKALMDYF AKIKIKTNDP
YWAAERCVAY IDQRSGLLQP DDTGNVYTFA HLTLQEHCAG RHLLFEEPLQ QMLAVRRDDR
WREPIFLGVG CLADDKRASS KIGEVLSALI NPDEFRSKPR KPKHRYEWYR DLVLAAAIGA
DCDWDTLNGT DLDVEYFHGA LRYGIVILLE DRAHAQAALD YYHGQLMEPA PLLVRERQKA
AELLAGLGDS RYPVNSDQWQ QETRQLSTQF SREGTHYWRF VPAGHYQVGG WYYDEQPPTV
VLQPYWVGRF MITVEQYQAF IEAGGYTNED YWTKHGRAYK KRSNKIVPRW WDDQTEQEYR
NQPIYGVSWY EAVAYCQWLT QQLSPFLPQG YGIRLASEAE WEVAVAYTTD GQRQPSPWGA
QPVTPEHAIY DWSTKNRPLS VGVGLLGQAA CGALDSVGNL WEWTATPYQQ NHGAVQLTLA
DSDDDMAVRG GAYYSRSTDI RCTARHRLRP DFDDYHRGFR CMLAPVVNAE S