Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5032 |
Symbol | |
ID | 5736991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 44690 |
End bp | 46225 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282199 |
Product | hypothetical protein |
Protein accession | YP_001547790 |
Protein GI | 159901544 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0989917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTAT ACCCTCAGTC CATTCTGCCC GTTCCTGCGT CCACCGCCCA CGCACTCCAT GCCGCGTTTC CCAACGGCAA TCGTTATATC GATCTCCGCG CAGAATTTGG GACGCTCTTT ACTGATGACC ATTTTTGTGC GTTGTATCCG GCGACTGGTC GCCCCGTGGT GGTTGCCCCC TGGCGCTTAG CCCTCGTCTG TGTCCTCCAA TTTATGGAAG GACTGACCGA TCGCCAAGCA GCGGATGCCG TTCGTCGGTG TATGGACTGG AAGTACGTGC TCAGTCTTGA CTTGACCGAT CCGGGCTTTG ATTTCACCGT GCTGCATGAT TTCCGTGAAC GCATCATTGC CGCTGATGCA ACACACGACC TCTTGACCCG CTTCCTTACG GCTTGTCAAG CCCGTGGCTT GATTAAAACG CGTGGGACGC ATCGCACGGA CGCAACCCAT ATGTTCGCAT CCGTGCGCAC CCTCCATCAG ATCGAATGTG TCCTTGAAAC CATGCATTGG GTGCTCAATA CCATGCCGGA TCATGATCCG GCATGGGTGG AGGCACATGT CCCACCCGCA TGGTTTGAGC GGTATGGCCT GCGGGCTGAT CGCATGCGGT TTCCAAAGGA CACCAGTAAA CGCACGGCAT TGGCAACGAC CATTGGTCAG GATGGAGCAA CCCTGCTTGA TTGGCTTGCC CAGCCAGCAA CGCCCCATGT TCTGCGCGAT CTTCACTGCA TCGCCGTCAT GCGCATGATT TGGATGCAAC AATTTTATCG CTGTACGATT CCTGGTGCCG AGCAGCTCCG GCTGCGTACC ATGGATGAAA AGCCCGCAAC CGCGCACCTG ATCCAATCAC CCTATGATGT TGAGGCTCGC TATAGCAGTA AACGGGACAC CGTGTGGAAT GGCTACAAAG TGCATCTCAC CGAAAGCTGT GATGATGGGT ATCCCGATCT GATTGTGCAT GTGGCGACGA CGAGCGCGAC GACGCAAGAC TTCCGCATGG GTGCATCCAT TCAGGATGCG GTTGCCGCAC ACGGCCTCAC GCCAGCGATC CATTTGATGG ACGGCGGGTA TGTGGATTCC CACCTGCTGG TCCATGCGCA ACAGTATGAC ACGATGGTGA TCGGGCCAAC CTTTGGCTCA TATAGTCGTC AGCGACGCGA GGATCACGGA TTTGCCCTTG CCGCCTTTGC GATTCATTGG GAGGCGCAAA CCGTTCAATG CCCACAAGGC CAGATGAGCG TGAAATGGAC ACCAGGCCAG ACCACCCACC ATGGCAAAGC GGCCTGTCGT GCCTGTTCGG TGCGGGCACA CTGTACGGCA GCGAAGGATG AGCCACGCCA ACGAACCCTC CGTCCCCAGC AACAACATCA TGCCTTATAC GATGCCCGTG CGCGTGAGCA AACAGCGGTA TTCAAACACC AGTTGCGGGC AGGGGTTGAA AGTACGATGG CCCAAGGAGC GCTTCGCTTT GGGATACGCC GGAGCCGCTG GGATGGACTT GGCCAATGCA TATGGGTTCA TGCGGTCTTT TGGTGA
|
Protein sequence | MTLYPQSILP VPASTAHALH AAFPNGNRYI DLRAEFGTLF TDDHFCALYP ATGRPVVVAP WRLALVCVLQ FMEGLTDRQA ADAVRRCMDW KYVLSLDLTD PGFDFTVLHD FRERIIAADA THDLLTRFLT ACQARGLIKT RGTHRTDATH MFASVRTLHQ IECVLETMHW VLNTMPDHDP AWVEAHVPPA WFERYGLRAD RMRFPKDTSK RTALATTIGQ DGATLLDWLA QPATPHVLRD LHCIAVMRMI WMQQFYRCTI PGAEQLRLRT MDEKPATAHL IQSPYDVEAR YSSKRDTVWN GYKVHLTESC DDGYPDLIVH VATTSATTQD FRMGASIQDA VAAHGLTPAI HLMDGGYVDS HLLVHAQQYD TMVIGPTFGS YSRQRREDHG FALAAFAIHW EAQTVQCPQG QMSVKWTPGQ TTHHGKAACR ACSVRAHCTA AKDEPRQRTL RPQQQHHALY DARAREQTAV FKHQLRAGVE STMAQGALRF GIRRSRWDGL GQCIWVHAVF W
|
| |