Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4461 |
Symbol | |
ID | 5736312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5704057 |
End bp | 5706006 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281624 |
Product | hypothetical protein |
Protein accession | YP_001547221 |
Protein GI | 159900974 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.608634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC AAGGTGCGTT CACGTTTGTC TTACATAGCC ACTTGCCCTA TTGTCGCAAG GCTGGCCGCT GGCCTCACGG TGAAGAATGG ATTCACGAGG CCGCTTCCGA GACGTATATT CCACTACTCA ATGCGCTCAA CGATCTGATC AACGATGGGG TTACACCACG CTTAACGATT GGGATTACGC CAATTCTAAC CGAGCAGCTT GCTGACCCCA CCATTTTGCA CAATTTTGAA GAATATCTTG ACGAGAAGAT CACTGCGGCG CAAGCTGATA TGGATCGACT GGCCGACGTT CAGGCAGTTT GGGATGCTGC CCAAGTTGCC GAACCTGAAA CCGAGCCAAC TCCCCTGCTC TCCAGCGAAG AGTTGGAAAG CTTGATCAGC AAAAGCGATG CGCTGCTTTC TTCAACTGCT GGTGATGCTC CGGCCCCACT TCACGCTGGT TTGCTGAGCG CCACTGCTGC TGCCAGCACC GAAGCCGAAG CAGATGATGA AGCGGAATCT GAAGAAACCG AGGAAGCAGC AGTTGAAGAA CCTGCTCCGA TCGAGCAGCC TGATCCACAT AAAGCCTATT TGGCCATGTG GTATCGCGAT TGGTACAGCA TGATCAAGCG TTCGTTTATC GAGCGCTACA ACCGCGATAT TGTCGGAGCT TTCCGCCAAT TGCAAGATGC TGGCTATATC GAAATTATCA CCTGTGGCGC GACCCACGGC TACTTGCCCT TGGTCAGCCG CGATTCAACA ATTTATGCCC AAATTGCCGT CGCTGTCCAA AGCTACGAAC GTCATTATGG CCGCAAGCCA AAGGCGATTT GGCTGCCCGA GTGCGCCTAT CGCCCAGCCT ATTATCCTGA AAACCCCAGC GAAACCGAGC GCAAGCCTGG CATCGAAGAA TTTCTTGAAG CCCAAGGCAT CGAGTGCTTC TTCGTCGAAA CCACCACCAT CGAGGGTGGC GCACCCATGG ATAAGGCTGA AGGCAAGATT CTTGGGCCAT ATGGCGATAC GTTGCGCCGC TATGTCGTGC CAGTCAGCCG CGAAATTCCG CCAACTGGCA ATAGCACACT CCAACCCTAC TTAGTTGGTT TGAGCGATAA AGTTGCGGCA ATTGGCCGCC ATCACAAAAC TGGCTTACAG GTGTGGTCGG CTGAATGGGG CTATCCAGGC GAGGCTAACT ACCGCGAGTT CCACCGCAAA GATAGCGAAA GCGGCATGCA ATATTGGCGG ATCACTGGGC CAAAAGTTGA CCTTGGTTAT AAAGATTACT ATCATCCCGA TTGGGTCAAC GATAAAGTTA ATGCTCACGC CGAGCACTTC ACGGGCTTGG TACAGCAGGT TATCAGCGAA TATCGCGGCC AAACTGGGCG CTATGGCCTG ATTTCATCAA ACTACGATAC CGAATTATTT GGTCACTGGT GGTTTGAAGG GGTCGATTGG ATGCGCGAAG TGCTACGGCG CTTGGCGACA AATCCCGACA TTGATCTCAC CACGGCCTCG GAATATATCG CCAGCAACCC ACCGCGTGAA TCGTTGAACC TGCCCGAAAG TTCGTGGGGT TCCAATGGTA CACACCAAAC CTGGCTCAAC CCTGAAACCG AGTGGATGTG GCCAATTATT CATGCCGCCG AAAAGCGCAT GGAAGGCTTG GTCGCCAGCT ATCCACAGGC AGATGGCGCT TTAGCCGAAG CCTTGGCTCA AACTGCGCGT GAGTTGCTCT TGCTGCAATC CAGCGATTGG CCGTTCTTGG TCACGACTGG GCAAGCCCAA GATTACGCCA CCAAGCGTTT CAACGAGCAT GTCGATCGCT ACAATCAATT GGCTGATGCA ATTGAGGCCA ATGATGTTGG CTTGATGGCT GAACTAACAG CCAGTTTCAA CGAGCTTGAT AATCCATTCC CCACGATTGA TTATCATGTC TTTGCCGCTC GCGAAGGCTC AGCAGCCTAA
|
Protein sequence | MPKQGAFTFV LHSHLPYCRK AGRWPHGEEW IHEAASETYI PLLNALNDLI NDGVTPRLTI GITPILTEQL ADPTILHNFE EYLDEKITAA QADMDRLADV QAVWDAAQVA EPETEPTPLL SSEELESLIS KSDALLSSTA GDAPAPLHAG LLSATAAAST EAEADDEAES EETEEAAVEE PAPIEQPDPH KAYLAMWYRD WYSMIKRSFI ERYNRDIVGA FRQLQDAGYI EIITCGATHG YLPLVSRDST IYAQIAVAVQ SYERHYGRKP KAIWLPECAY RPAYYPENPS ETERKPGIEE FLEAQGIECF FVETTTIEGG APMDKAEGKI LGPYGDTLRR YVVPVSREIP PTGNSTLQPY LVGLSDKVAA IGRHHKTGLQ VWSAEWGYPG EANYREFHRK DSESGMQYWR ITGPKVDLGY KDYYHPDWVN DKVNAHAEHF TGLVQQVISE YRGQTGRYGL ISSNYDTELF GHWWFEGVDW MREVLRRLAT NPDIDLTTAS EYIASNPPRE SLNLPESSWG SNGTHQTWLN PETEWMWPII HAAEKRMEGL VASYPQADGA LAEALAQTAR ELLLLQSSDW PFLVTTGQAQ DYATKRFNEH VDRYNQLADA IEANDVGLMA ELTASFNELD NPFPTIDYHV FAAREGSAA
|
| |