Gene Haur_4025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4025 
Symbol 
ID5735886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5137458 
End bp5139317 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content47% 
IMG OID641281175 
Producthypothetical protein 
Protein accessionYP_001546785 
Protein GI159900538 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000214735 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AAACATTAAC GAGTTTGTTT GCCGAGTCAA TCTACCAAAT TCCAGATTAT 
CAGCGTGGTT ATGCTTGGGA AGAAAAACAG TGGAAAGATT TTATACAAGA TATTGATGCC
CTGGTCGATG AGCAGGTGAC CAGTCATTAT ACTGGAACCG TTGTAGTTTA CGAAGGCCGC
GACGCTGAAA AGCGACCATA TGGCCGAAAG AAGCTAAAAG TGCTTGATGT GGTTGATGGA
CAGCAGCGAT TAACCACCAC TTGCCTCTAT CTTTCAGTAA TTATTCGCGC ACTGATTCAG
CATGGAGAGT CGGACTATGA GCGCGATATT GATGATTTTC TGTATGCAGG CGCAACCTGC
AAGCTGAACC TCAATAATGA AACTGGCGAC ATCTTTTATG ATCTTTTAAA GACAGGCTAT
GTCAATACCC CGCTTCAGTC GCCACATCAG CATCGGCTCG TCGAGGCCCA CCGTCGCTTT
CAACACCACA TTAGCGAGCA GTTGCAGCAA CGAGGTGGCG CTGGGGTCGC TTATCTCAAA
GAATTGCATT ATGCGATTAC CCAAAAACTC AATTTTACCT TCTATGTGAT CGAATCAGAA
GCCGAAATTG GCATGACCTT CGAGTTGATG AACTCGCGGG GCAAGGATCT TTCGGTGCTT
GAACTGCTCA AAAATTATTT AATGCACTGG GTTTCGCGCA ACGAAAACAA CCTTGCAGAT
CGTGAAACCC TCACCAAACT GATCAATCGC AGTTGGAAAG ACACCTACAC CAACCTTGGC
GCAAGTTCGG GCAACAATGA AGATCAATGT CTACGCATCG CTTGGACGCT CTATTGCAGC
CATTCCCCAG CCAATTGGCA TGGGTATGAA GGCTTCAAAG CTGATGAATA CATCCCGCTG
AGAACATTTA GTAAACGCAC GAAGGCCGAG ACAAAAATAT TTATCGAGCA CTTTGTGATG
GGCCTTGCTG AAGTTTCACA TCATTATGCC AGCATTATCA ATCCAACCAC CACCACAGCG
CTATTCGAAG CTGAGCGGAT TTGGCTCAGC AAGATTCGGC ATACCGGCAA CATTGCCAAT
TTCTTGCCCT TGATGGTGGC AGCCCGCAAG CAATACCAAG CAGGGCAGAT TAGCGAAGGC
GCATACATCG ACATGCTCAA GGCACTCGAA TGCTATGCCT ATCGTGTATT TCTTTGGGCA
GCCCGCCGCA GCAATGCTGG TAAATCAAGC TTCTATCGTT GGGGATACGA GATCTTTACT
CAGCCGCAAC TGATCAGTGA TATTACGCGC GGGATTCATC AACTGACCCG CTACTATGCA
CCTGAAGATG ATTTTATCAA CGGCAATGCC AACCCCAGCG ATTGGTATAG AACTCGGAAT
CGCTTGAGGT ACACCCTGTT TGAGTATGAG TTACATCTGC TTGCGACCGA GGGGAAAAAT
AGCGAACCAC GACTTGGCTG GGATCAGCTC AGCGATTCGA CGATTGAGCA TATTCTGCCG
CAGAATCCAG CAAAACATTC GCATTGGAAT GGCGTATGGA ACAAAACCGC GTTCAATGCA
AGTGTCCACG ATATCGCCAA TCTTGTGCTT ACCCACAATA ATGCCAGCTA TAGCAACTTT
GAGTTTGCCC GCAAAAAGGG CCAACCAGGC CTAAGTCCTA GTTATAGCGA TTCTGATATT
CGCCAAGAAC GTAAACTCGC GGCCTTTGCC GATTGGACTC CCAAAGAGTT TGCTGAACGC
CGAAACGAGT TGATCATATG GATCAATCAG CGCTGGAAAA CCGTCGGCGA ACCCGACAAT
GCAACGTTGG AAGTCAACGA CGAGGCTGAT GACGATGGCA TCGAGCATCA AGAAGGATAA
 
Protein sequence
MNKKTLTSLF AESIYQIPDY QRGYAWEEKQ WKDFIQDIDA LVDEQVTSHY TGTVVVYEGR 
DAEKRPYGRK KLKVLDVVDG QQRLTTTCLY LSVIIRALIQ HGESDYERDI DDFLYAGATC
KLNLNNETGD IFYDLLKTGY VNTPLQSPHQ HRLVEAHRRF QHHISEQLQQ RGGAGVAYLK
ELHYAITQKL NFTFYVIESE AEIGMTFELM NSRGKDLSVL ELLKNYLMHW VSRNENNLAD
RETLTKLINR SWKDTYTNLG ASSGNNEDQC LRIAWTLYCS HSPANWHGYE GFKADEYIPL
RTFSKRTKAE TKIFIEHFVM GLAEVSHHYA SIINPTTTTA LFEAERIWLS KIRHTGNIAN
FLPLMVAARK QYQAGQISEG AYIDMLKALE CYAYRVFLWA ARRSNAGKSS FYRWGYEIFT
QPQLISDITR GIHQLTRYYA PEDDFINGNA NPSDWYRTRN RLRYTLFEYE LHLLATEGKN
SEPRLGWDQL SDSTIEHILP QNPAKHSHWN GVWNKTAFNA SVHDIANLVL THNNASYSNF
EFARKKGQPG LSPSYSDSDI RQERKLAAFA DWTPKEFAER RNELIIWINQ RWKTVGEPDN
ATLEVNDEAD DDGIEHQEG