Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4856 |
Symbol | |
ID | 5736702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6187955 |
End bp | 6191902 |
Gene Length | 3948 bp |
Protein Length | 1315 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641282022 |
Product | hypothetical protein |
Protein accession | YP_001547614 |
Protein GI | 159901367 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.212334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCATG CATTTTCACT GCGTTTTACT CGTTTGCTTT CATTAGGAAT GGTCGGTTTG TTGATCGTGA CGGGGGCGCT GCTCAATCGC CCAGTCGCTG CCCAACGGGT AGTCCCACCA ACTTCCAAGC AGCTCGATCA ATCGGTCATC CAACAGGCTG AGGCTTTGAA TCAAGCTGCT GCTCAGCGCG AAGTGCCGCT TACCATTCAA CGCTTGCAAG CCTTGAATTC ACCGCCAGCC GCGCCAAAAC CGGTAGCTCA GCCAAGGCCA TTTGTTCCGA CAGGGATTTT TGTGGTCAAT ACGGTTGCTG ATACCGACGA TGGCGCGTGC GATGCTCTTG GCACGGGCAC TGGCAACCAA GATTGTAGCT TGCGCGAAGC GATTAACGCG GCGAATGCCA ATGTTGATGC CGACACGATT AATTTTGCGA TTCCAGGGGC TGGGGTCAAA ACCATCGTTC TCACCAGCGA ATTGCCAGCC TTGCAAACGC AAATTCAAAT CGACGGCTGT AGCCAAGCAG GCAGCGATTG CAGCACATGG CCAATGAGCT TGGTGGTCGA GCTTGATGGT TCAGGTTTGA GCGACCCTAG CGGATTTACT GATACTGAAA TTTTGCTGGC CGAAGCTGAT GGCTCGGTAA TTCGTGGCTT GGTTTTGAAC GCTGCTCGTA GCCCAAGCTT TTATACTTGG ATTGGAATTA ACGTCAAAGG TGACAACGTT TGGCTAACCC AAAATATTAT CGGCCTCGAA CCCGACGGCC AAACCGCCAA CGGCAATAAT ATGGGCATTT TGCTCTATCC AGACAGTAAT CAAGCGATGA TTGGCACCAA TGGCGATGGC AGTAATGATG CGCTCGAAGG TAATATTATT GCTGGCCAAA CCTTCAATGG TATTTCATTC TCAGGTGGCG CGAATAGCCG CATCGCTGGT AACTATATTG GGGTGGCCAG CGACGGCAAT ACTGCCCGCA GTAATCGCGC TGGCATCGAA TTTTTTGGGC CGACCATTGG CAATGTCACA ATTGGCACCA ACAGCGATGG CATTGGCGAT GCCGCCGAAC GCAATATTAT CAGCGGTAAC ATCTTAGGCA TCTATATTTA TGGCTCATCA AATAATCACA TTCGTGGCAA CTATATTGGA TTAAATGCAG CTGGAACTGC CGCAGCAGCC AACGATACAG GTATTCAGAT TAGTGGCAAT ACGATTGAGA CGATCATTGG AACCAATGGT GATGGCGTGC GCGATGCAGT CGAAGGTAAC GTGATCAGTG GCAATAGCGC CGAGGGCGTG CAAATTGCCG AATTTAGCGG TGGCTCCAGC GGTTTCCCAA CCGATAACGT CATTGCAGGC AATATTATCG GGCTTGATCC AACTGGCTCG ACCGCCATTG GCAATCAGCG TGGGGTCGTG CTGCGTTTTG GGCCAAGTGG CACGCGCATC GGCACCAATG GCGATGGCAT GAGCGATGCT TTAGAACGCA ACATTATTAG CGGCAACAGC GATCTTGGGA TTGGCGTTGG TGGTGGCGAT CAGCCAATCA CCGATACGAT CATCAGTGGC AACTACATTG GTACTGACAG CACTGGTTTA GTAGCTCGCC CAAACAGAGG TGGCGTGCAA ATTCAAAATG AAGTTGCTGG CTTAGTGCTG GGCACTGATG GCGACGAAAC TGCCGACGAT GCCGAGGGCA ACCTGATTAG TGGCAATAAC GGCAATGGGG TAAACTTCGT TTTCGACCCT ACCAATGTTA CCATGCAAGG CAATCGCATC GGCGTAGCGC TCGATAACAG CCCGCTTGGC AACCAAGGCG ATGGCATTGC GCTGCTCGAA TTGAGCGAAT TGGCTCCGAG CAACATCAAA ATCGGCGGCG AAGGCTTATT TAGCGAAAAT AGCATTGCGT TCAATGCTGA GCTAGGCATT GATATTAACA ATGATGGCCC CACCGCCAAC GATCTTGATG ATCTTGATGC AGGCCCGAAT AGCACCCAAA ATTATCCAGT AGTCGTCAGT GTTATTGATG ATGGCACAAC GGTTACAATC CAAGGGACAT TCAACAGCAC TGCCAGCCAA GAGCTACGTT TGGCTTTTTA CAGCAATACC ACCTGCGATA CTTCGGGCTA TGGCGAGGGC CAACAATTCT TAGAAAGTGA TTTTTTCACG ACCGATGCCG CTGGAAATGG CGCATTTACC AAGCTGTTTC CCAGCCCAGC CCCAAATATC ACGATTTTAG CAACGAACGT CAATACCAAC GAAACCTCGG AGTTTTCGCA ATGTTGGGTT GCGCCCCCAG CGACAGCCAC CCCAACCGCA ACCAACACAG CGACCAACAC GGCGACGAAC ACGCCGACGA ACACACCAAC CAACACGGCG ACAAATACCC CAAGCAACAC GCCAACCAAT ACGGCGACGA ACACGCCAAC GTTAACCCCA AGCAACACGC CAACCAATAC GGCGACGAAT ACGCCAAGTA ACACGCCAAC GCCCAGCAAT ACCGCTTTGC CAGTTACCCC AACAATTCCG CCAAGTGCAT GTGTGCCAGG CAGCAATCTG CAAGTGTTGT TCAGCGATAG CGTGAGCAGC GAACCACTGT GGAATGTCAG TGGCACAGGC CCAACCTGGG TCTTGGATAA TAGCAGTTTT AATAGCCCCA GCCTCTCGTG GCATGCCGAT AATCCTGAGA CGATTAGCGA TCAACGGCTA ACGACGATGG CCGGCATCAC AATTCCAGCG GGCGCGAGTG AAGTAACGTT GTATTTCAAC CATAACTATA GCTTTGAGTT TGATAACGAA GGTGGTGGCG GCGAACCAAC CCCAATGCCC GAACGCTCAG CCAAACGCAA CAATACACCA TTTGATATTT ATTTTGATGG TGGGGTGGTT GAATATAGCA CTGATGGCAC AACCTTTAAC GATCTTGGAA GTTTCTTTAC CAGCTCGGGC TACGATGGCG TACTGACACT TGAGAGCGAT AATCCCTTAG CGGGACAATC GGCATTCGTT GGTTTGAGCG GCGATTATCC ATTTTATATT GAAGAAACTG CTGATTTAAC CGCCTTGGCA GGCCAAACAA TTTGGTTGCG CTTCCGCATG GGCAGCGACA GCTTGGTTAG CGCTCTTGGC TGGAATGTTG ATGATATTGT GGTGGCTGGC TGTGTGCCAA GCGCCGCCAC GAACACGCCA ACGGCAACCG CTACGCCAAC GGCAACCGCT ACGCCAACTG CCACGAACAC GGCGACAGCA ACACCAAGTA ATACGCCAAC CAATACGCCA ACACCAAGTA ATACGCCAAC CAGTACCGCG ACAGCGACGG TCACTAATAC ACCAACTAAT ACAGCAACGC CAACGGTCAG CAACACACCA ACTGCTACTG GCACGCCATT TATTGGCACA AGCCAAAACT TCTTGCCGCT GGTGTTAGGC TATTGCTATG CCGATTTACA GGTGAGCAGT ATCACGGTTG ACCAGAAATT AGTTGTGACA GTCACCAATC AAGGTAATTG TGCGACGAGT GAAGCATTTT GGGTCGATTT GTATATCGCG CCCAATCCAG CGCCTAGCCA TGTCAACCAA CAATGGTGGG ATGTTGCCAG TCAAGGCATT GTTTGGGGTG TGACCGAGGC ACTTGCGCCA GGTCAATCGA TCACGCTTCG GCCCTATGAT CAATATTACT CAGCACGGCG TAGCGAGTGG GCTAACCGCA TTGCCACCCA AACCAAGCTG TATGTTCAGG CTGATGCCTA CAATGCTGCA ACTAGCTATG GCGCAGTTTT GGAGTTGCAT GAAGCCTTCA ATTTTGAATA CAACAACATT ACGTCAATTA TCACCAGCGC TCATTTCAGC CAACCAACTG GGTTACGCCA GCAATTAGCA ACCGAAACAG GCTTGCCGCT ACGGGTTATG CCAAGCAATC CAAGCCCAAA TCAATTTCGG CCAATTCCCG TTCGCTAA
|
Protein sequence | MKHAFSLRFT RLLSLGMVGL LIVTGALLNR PVAAQRVVPP TSKQLDQSVI QQAEALNQAA AQREVPLTIQ RLQALNSPPA APKPVAQPRP FVPTGIFVVN TVADTDDGAC DALGTGTGNQ DCSLREAINA ANANVDADTI NFAIPGAGVK TIVLTSELPA LQTQIQIDGC SQAGSDCSTW PMSLVVELDG SGLSDPSGFT DTEILLAEAD GSVIRGLVLN AARSPSFYTW IGINVKGDNV WLTQNIIGLE PDGQTANGNN MGILLYPDSN QAMIGTNGDG SNDALEGNII AGQTFNGISF SGGANSRIAG NYIGVASDGN TARSNRAGIE FFGPTIGNVT IGTNSDGIGD AAERNIISGN ILGIYIYGSS NNHIRGNYIG LNAAGTAAAA NDTGIQISGN TIETIIGTNG DGVRDAVEGN VISGNSAEGV QIAEFSGGSS GFPTDNVIAG NIIGLDPTGS TAIGNQRGVV LRFGPSGTRI GTNGDGMSDA LERNIISGNS DLGIGVGGGD QPITDTIISG NYIGTDSTGL VARPNRGGVQ IQNEVAGLVL GTDGDETADD AEGNLISGNN GNGVNFVFDP TNVTMQGNRI GVALDNSPLG NQGDGIALLE LSELAPSNIK IGGEGLFSEN SIAFNAELGI DINNDGPTAN DLDDLDAGPN STQNYPVVVS VIDDGTTVTI QGTFNSTASQ ELRLAFYSNT TCDTSGYGEG QQFLESDFFT TDAAGNGAFT KLFPSPAPNI TILATNVNTN ETSEFSQCWV APPATATPTA TNTATNTATN TPTNTPTNTA TNTPSNTPTN TATNTPTLTP SNTPTNTATN TPSNTPTPSN TALPVTPTIP PSACVPGSNL QVLFSDSVSS EPLWNVSGTG PTWVLDNSSF NSPSLSWHAD NPETISDQRL TTMAGITIPA GASEVTLYFN HNYSFEFDNE GGGGEPTPMP ERSAKRNNTP FDIYFDGGVV EYSTDGTTFN DLGSFFTSSG YDGVLTLESD NPLAGQSAFV GLSGDYPFYI EETADLTALA GQTIWLRFRM GSDSLVSALG WNVDDIVVAG CVPSAATNTP TATATPTATA TPTATNTATA TPSNTPTNTP TPSNTPTSTA TATVTNTPTN TATPTVSNTP TATGTPFIGT SQNFLPLVLG YCYADLQVSS ITVDQKLVVT VTNQGNCATS EAFWVDLYIA PNPAPSHVNQ QWWDVASQGI VWGVTEALAP GQSITLRPYD QYYSARRSEW ANRIATQTKL YVQADAYNAA TSYGAVLELH EAFNFEYNNI TSIITSAHFS QPTGLRQQLA TETGLPLRVM PSNPSPNQFR PIPVR
|
| |