Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5042 |
Symbol | |
ID | 5737001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 56779 |
End bp | 57903 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641282209 |
Product | putative transposase |
Protein accession | YP_001547800 |
Protein GI | 159901554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACGC TCAACGCCAT CGTCCAACGC TCTGGTGCGG CGTATCGCGC CCAGTGTGGG ACACGGCTCT CCGATCACCA GCGCCGCGTT ATGCACGCGA TCGAAGCCTG TCGCACTGAG GCGCTCGGCG GCCAAGTCTT CACGTGTCCT GCCTGTCAGA CCCTCCGCTA TAGCTACCAT TCGTGCCGCA ATCGCCATTG CCCCACCTGT CAACAGGATG CTGGCGCGGC GTGGCTGACG GATCAGCAGG CATTATTACT GGCGGTGCCG TATTTCCTTG TCACGTTTAC GATCCCTGCC GAGCTGCGTC CGGTCGCCCG CGCGAATCAA GCCCAAGTCT ATGCCGCCAT GTTTCGGGCC TCAGCAGCGG CGCTCCAGCA GGTCGCCGCT GATCCGCGCC ATCTTGGTGG ACAGTTAGGC ATGCTGGGGG TCTTGCAGAC CTGGACCCGT GATCTCCGCT ACCATCCGCA CATCCACTAT CTGATTCCTG GAGTTGGTCG CACCGCCACG GGACAACTCG TGTTCCCGCC AGCGGAGAAT TTCCTCCTGC CGATGCGACC ATTGGCAGTC CTCTTCCGCG CCAAACTGCG GGCTGCTCTG CGCCAAATAC CCCATGGTAC CGTCATCCCA GCAGCAGTGT GGGAGCACGA CTGGGTGATT GACTGTCGCC CCGTCGGCAC CGGCGAAACG GCCTTGAAGT ATTTGGCACC GTATATCTTC CGCGTGGCGC TCAGTAATAA CCGCCTGCTC AGCGCCGATG ATGACCAGGT CACCTTTCGC TATCGTCACA GCGACAGTGG CGAGAACCGG ACGAGCACGC TCCCGGTGAA CACCTTCCTT GACCGCTTTA TGGCCCATAT CCTGCCCAAA GGGTTCGTCA AAGTGCGCTA TTACGGGTTT TTCCGACCAG CGGTGCGTGC CGACCTGCGG CGCATCCAAG CCCAACTGTG GTTACTCCGC AGACACGGTC AGTTTGACCC AGGCATCCCC CAGCCAGCTG CGCTGGCGCG TGGAGCGCAG ATCACGCCAT GTCCGGTCTG TGGCATTCAG ATGCAGGGTC GGCATCTCGC GCCCCGCTCG GCACGTGCGC CACCACAGAG CGTGAACGCT CCTGGCCTAG CGTGA
|
Protein sequence | MITLNAIVQR SGAAYRAQCG TRLSDHQRRV MHAIEACRTE ALGGQVFTCP ACQTLRYSYH SCRNRHCPTC QQDAGAAWLT DQQALLLAVP YFLVTFTIPA ELRPVARANQ AQVYAAMFRA SAAALQQVAA DPRHLGGQLG MLGVLQTWTR DLRYHPHIHY LIPGVGRTAT GQLVFPPAEN FLLPMRPLAV LFRAKLRAAL RQIPHGTVIP AAVWEHDWVI DCRPVGTGET ALKYLAPYIF RVALSNNRLL SADDDQVTFR YRHSDSGENR TSTLPVNTFL DRFMAHILPK GFVKVRYYGF FRPAVRADLR RIQAQLWLLR RHGQFDPGIP QPAALARGAQ ITPCPVCGIQ MQGRHLAPRS ARAPPQSVNA PGLA
|
| |