Gene Haur_4856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4856 
Symbol 
ID5736702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6187955 
End bp6191902 
Gene Length3948 bp 
Protein Length1315 aa 
Translation table11 
GC content51% 
IMG OID641282022 
Producthypothetical protein 
Protein accessionYP_001547614 
Protein GI159901367 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.212334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATG CATTTTCACT GCGTTTTACT CGTTTGCTTT CATTAGGAAT GGTCGGTTTG 
TTGATCGTGA CGGGGGCGCT GCTCAATCGC CCAGTCGCTG CCCAACGGGT AGTCCCACCA
ACTTCCAAGC AGCTCGATCA ATCGGTCATC CAACAGGCTG AGGCTTTGAA TCAAGCTGCT
GCTCAGCGCG AAGTGCCGCT TACCATTCAA CGCTTGCAAG CCTTGAATTC ACCGCCAGCC
GCGCCAAAAC CGGTAGCTCA GCCAAGGCCA TTTGTTCCGA CAGGGATTTT TGTGGTCAAT
ACGGTTGCTG ATACCGACGA TGGCGCGTGC GATGCTCTTG GCACGGGCAC TGGCAACCAA
GATTGTAGCT TGCGCGAAGC GATTAACGCG GCGAATGCCA ATGTTGATGC CGACACGATT
AATTTTGCGA TTCCAGGGGC TGGGGTCAAA ACCATCGTTC TCACCAGCGA ATTGCCAGCC
TTGCAAACGC AAATTCAAAT CGACGGCTGT AGCCAAGCAG GCAGCGATTG CAGCACATGG
CCAATGAGCT TGGTGGTCGA GCTTGATGGT TCAGGTTTGA GCGACCCTAG CGGATTTACT
GATACTGAAA TTTTGCTGGC CGAAGCTGAT GGCTCGGTAA TTCGTGGCTT GGTTTTGAAC
GCTGCTCGTA GCCCAAGCTT TTATACTTGG ATTGGAATTA ACGTCAAAGG TGACAACGTT
TGGCTAACCC AAAATATTAT CGGCCTCGAA CCCGACGGCC AAACCGCCAA CGGCAATAAT
ATGGGCATTT TGCTCTATCC AGACAGTAAT CAAGCGATGA TTGGCACCAA TGGCGATGGC
AGTAATGATG CGCTCGAAGG TAATATTATT GCTGGCCAAA CCTTCAATGG TATTTCATTC
TCAGGTGGCG CGAATAGCCG CATCGCTGGT AACTATATTG GGGTGGCCAG CGACGGCAAT
ACTGCCCGCA GTAATCGCGC TGGCATCGAA TTTTTTGGGC CGACCATTGG CAATGTCACA
ATTGGCACCA ACAGCGATGG CATTGGCGAT GCCGCCGAAC GCAATATTAT CAGCGGTAAC
ATCTTAGGCA TCTATATTTA TGGCTCATCA AATAATCACA TTCGTGGCAA CTATATTGGA
TTAAATGCAG CTGGAACTGC CGCAGCAGCC AACGATACAG GTATTCAGAT TAGTGGCAAT
ACGATTGAGA CGATCATTGG AACCAATGGT GATGGCGTGC GCGATGCAGT CGAAGGTAAC
GTGATCAGTG GCAATAGCGC CGAGGGCGTG CAAATTGCCG AATTTAGCGG TGGCTCCAGC
GGTTTCCCAA CCGATAACGT CATTGCAGGC AATATTATCG GGCTTGATCC AACTGGCTCG
ACCGCCATTG GCAATCAGCG TGGGGTCGTG CTGCGTTTTG GGCCAAGTGG CACGCGCATC
GGCACCAATG GCGATGGCAT GAGCGATGCT TTAGAACGCA ACATTATTAG CGGCAACAGC
GATCTTGGGA TTGGCGTTGG TGGTGGCGAT CAGCCAATCA CCGATACGAT CATCAGTGGC
AACTACATTG GTACTGACAG CACTGGTTTA GTAGCTCGCC CAAACAGAGG TGGCGTGCAA
ATTCAAAATG AAGTTGCTGG CTTAGTGCTG GGCACTGATG GCGACGAAAC TGCCGACGAT
GCCGAGGGCA ACCTGATTAG TGGCAATAAC GGCAATGGGG TAAACTTCGT TTTCGACCCT
ACCAATGTTA CCATGCAAGG CAATCGCATC GGCGTAGCGC TCGATAACAG CCCGCTTGGC
AACCAAGGCG ATGGCATTGC GCTGCTCGAA TTGAGCGAAT TGGCTCCGAG CAACATCAAA
ATCGGCGGCG AAGGCTTATT TAGCGAAAAT AGCATTGCGT TCAATGCTGA GCTAGGCATT
GATATTAACA ATGATGGCCC CACCGCCAAC GATCTTGATG ATCTTGATGC AGGCCCGAAT
AGCACCCAAA ATTATCCAGT AGTCGTCAGT GTTATTGATG ATGGCACAAC GGTTACAATC
CAAGGGACAT TCAACAGCAC TGCCAGCCAA GAGCTACGTT TGGCTTTTTA CAGCAATACC
ACCTGCGATA CTTCGGGCTA TGGCGAGGGC CAACAATTCT TAGAAAGTGA TTTTTTCACG
ACCGATGCCG CTGGAAATGG CGCATTTACC AAGCTGTTTC CCAGCCCAGC CCCAAATATC
ACGATTTTAG CAACGAACGT CAATACCAAC GAAACCTCGG AGTTTTCGCA ATGTTGGGTT
GCGCCCCCAG CGACAGCCAC CCCAACCGCA ACCAACACAG CGACCAACAC GGCGACGAAC
ACGCCGACGA ACACACCAAC CAACACGGCG ACAAATACCC CAAGCAACAC GCCAACCAAT
ACGGCGACGA ACACGCCAAC GTTAACCCCA AGCAACACGC CAACCAATAC GGCGACGAAT
ACGCCAAGTA ACACGCCAAC GCCCAGCAAT ACCGCTTTGC CAGTTACCCC AACAATTCCG
CCAAGTGCAT GTGTGCCAGG CAGCAATCTG CAAGTGTTGT TCAGCGATAG CGTGAGCAGC
GAACCACTGT GGAATGTCAG TGGCACAGGC CCAACCTGGG TCTTGGATAA TAGCAGTTTT
AATAGCCCCA GCCTCTCGTG GCATGCCGAT AATCCTGAGA CGATTAGCGA TCAACGGCTA
ACGACGATGG CCGGCATCAC AATTCCAGCG GGCGCGAGTG AAGTAACGTT GTATTTCAAC
CATAACTATA GCTTTGAGTT TGATAACGAA GGTGGTGGCG GCGAACCAAC CCCAATGCCC
GAACGCTCAG CCAAACGCAA CAATACACCA TTTGATATTT ATTTTGATGG TGGGGTGGTT
GAATATAGCA CTGATGGCAC AACCTTTAAC GATCTTGGAA GTTTCTTTAC CAGCTCGGGC
TACGATGGCG TACTGACACT TGAGAGCGAT AATCCCTTAG CGGGACAATC GGCATTCGTT
GGTTTGAGCG GCGATTATCC ATTTTATATT GAAGAAACTG CTGATTTAAC CGCCTTGGCA
GGCCAAACAA TTTGGTTGCG CTTCCGCATG GGCAGCGACA GCTTGGTTAG CGCTCTTGGC
TGGAATGTTG ATGATATTGT GGTGGCTGGC TGTGTGCCAA GCGCCGCCAC GAACACGCCA
ACGGCAACCG CTACGCCAAC GGCAACCGCT ACGCCAACTG CCACGAACAC GGCGACAGCA
ACACCAAGTA ATACGCCAAC CAATACGCCA ACACCAAGTA ATACGCCAAC CAGTACCGCG
ACAGCGACGG TCACTAATAC ACCAACTAAT ACAGCAACGC CAACGGTCAG CAACACACCA
ACTGCTACTG GCACGCCATT TATTGGCACA AGCCAAAACT TCTTGCCGCT GGTGTTAGGC
TATTGCTATG CCGATTTACA GGTGAGCAGT ATCACGGTTG ACCAGAAATT AGTTGTGACA
GTCACCAATC AAGGTAATTG TGCGACGAGT GAAGCATTTT GGGTCGATTT GTATATCGCG
CCCAATCCAG CGCCTAGCCA TGTCAACCAA CAATGGTGGG ATGTTGCCAG TCAAGGCATT
GTTTGGGGTG TGACCGAGGC ACTTGCGCCA GGTCAATCGA TCACGCTTCG GCCCTATGAT
CAATATTACT CAGCACGGCG TAGCGAGTGG GCTAACCGCA TTGCCACCCA AACCAAGCTG
TATGTTCAGG CTGATGCCTA CAATGCTGCA ACTAGCTATG GCGCAGTTTT GGAGTTGCAT
GAAGCCTTCA ATTTTGAATA CAACAACATT ACGTCAATTA TCACCAGCGC TCATTTCAGC
CAACCAACTG GGTTACGCCA GCAATTAGCA ACCGAAACAG GCTTGCCGCT ACGGGTTATG
CCAAGCAATC CAAGCCCAAA TCAATTTCGG CCAATTCCCG TTCGCTAA
 
Protein sequence
MKHAFSLRFT RLLSLGMVGL LIVTGALLNR PVAAQRVVPP TSKQLDQSVI QQAEALNQAA 
AQREVPLTIQ RLQALNSPPA APKPVAQPRP FVPTGIFVVN TVADTDDGAC DALGTGTGNQ
DCSLREAINA ANANVDADTI NFAIPGAGVK TIVLTSELPA LQTQIQIDGC SQAGSDCSTW
PMSLVVELDG SGLSDPSGFT DTEILLAEAD GSVIRGLVLN AARSPSFYTW IGINVKGDNV
WLTQNIIGLE PDGQTANGNN MGILLYPDSN QAMIGTNGDG SNDALEGNII AGQTFNGISF
SGGANSRIAG NYIGVASDGN TARSNRAGIE FFGPTIGNVT IGTNSDGIGD AAERNIISGN
ILGIYIYGSS NNHIRGNYIG LNAAGTAAAA NDTGIQISGN TIETIIGTNG DGVRDAVEGN
VISGNSAEGV QIAEFSGGSS GFPTDNVIAG NIIGLDPTGS TAIGNQRGVV LRFGPSGTRI
GTNGDGMSDA LERNIISGNS DLGIGVGGGD QPITDTIISG NYIGTDSTGL VARPNRGGVQ
IQNEVAGLVL GTDGDETADD AEGNLISGNN GNGVNFVFDP TNVTMQGNRI GVALDNSPLG
NQGDGIALLE LSELAPSNIK IGGEGLFSEN SIAFNAELGI DINNDGPTAN DLDDLDAGPN
STQNYPVVVS VIDDGTTVTI QGTFNSTASQ ELRLAFYSNT TCDTSGYGEG QQFLESDFFT
TDAAGNGAFT KLFPSPAPNI TILATNVNTN ETSEFSQCWV APPATATPTA TNTATNTATN
TPTNTPTNTA TNTPSNTPTN TATNTPTLTP SNTPTNTATN TPSNTPTPSN TALPVTPTIP
PSACVPGSNL QVLFSDSVSS EPLWNVSGTG PTWVLDNSSF NSPSLSWHAD NPETISDQRL
TTMAGITIPA GASEVTLYFN HNYSFEFDNE GGGGEPTPMP ERSAKRNNTP FDIYFDGGVV
EYSTDGTTFN DLGSFFTSSG YDGVLTLESD NPLAGQSAFV GLSGDYPFYI EETADLTALA
GQTIWLRFRM GSDSLVSALG WNVDDIVVAG CVPSAATNTP TATATPTATA TPTATNTATA
TPSNTPTNTP TPSNTPTSTA TATVTNTPTN TATPTVSNTP TATGTPFIGT SQNFLPLVLG
YCYADLQVSS ITVDQKLVVT VTNQGNCATS EAFWVDLYIA PNPAPSHVNQ QWWDVASQGI
VWGVTEALAP GQSITLRPYD QYYSARRSEW ANRIATQTKL YVQADAYNAA TSYGAVLELH
EAFNFEYNNI TSIITSAHFS QPTGLRQQLA TETGLPLRVM PSNPSPNQFR PIPVR