Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3953 |
Symbol | |
ID | 5735814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4958928 |
End bp | 4960721 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641281103 |
Product | Tn7-like transposition protein D |
Protein accession | YP_001546713 |
Protein GI | 159900466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTCG CCTCAGCTAA CATTGCTGTT GAAGAACTTT TTGGCACACG AAATATTGTA GCTGTTGTTG ATCTTCCTAG CCGACTAGAT ATTTTATCCA ATCGGCTACC CAAGGCATTA GGTCTTAATT CGGAACTACT TATCAGCAAT CACACGTCAT TGCCTTTGTT TGCGCCTTTT TTATCTGGTG ATGCTCTCCA CAGAGTCCGA CGTTCAATGC TCGGCAATAA AGGGGGAACA GTACATTCGC GCATTGGTAT TATGGCTAGT GGAGTCAAGA GCCTCAGAAA GCTGCGTTTT TGTCCAGTGT GTGTCAATGA AGATCGGTCA AAATATGGAG AATGTTATTG GCATCGGGTT CATAATATCG CAGGAATCAA TGTTTGTGCC CATCACGGAT GCATTTTACA AGATAGCATT CTTTCATTTC AACGACAGAG CACCTATTAC GCCTACAACG CAGCAGAAGA TTACATACCA GAGATAGAGT TTGCAGATCC GACTCCAGCA AATCTGCATC TCTATGCTCT TGCCAAGCAA GCCTATTGGC TCTTAAACAA TATATCTAAT GCAGATCTGA AGGATATGTA TAATGTCTAT CGAAGATGCC TTGATTTAAG AGGGCTTACG ACGGCATCCA ATAGCATTCG TCAGCAGCAA CTTCAATCCA CATTCATAGA TTTCTACACG CCTGAATTGC TTGATACTAT ATCTTGCTCT TTAGAAGATG TTCATCAAAC ATGGTTATCC CGCCTAGTTG GCGGAGATTA CGTAGCACAT CATCCTCTTC GTCATCTTCT GGTACTTCAT TTCTTTGGCC ACACGCCGGA GAGCGCATTG AAGAGTCTTG ATGAACCTAC GCCCGATCCA TTCGGTCACG GCCCTTGGCC ATGTCTTAAT CGAGTTTGTC CACATTCACA CAAAGATGTT ATTGAAACAT GTAATGTAAG GCGTAGTCGG TATACCCGTG ATCGGCCAAT TGGGACATTT GCTTGTCCAT TTTGTGGATT CAGCTATATA CGGACAGGGC CGGATCAATC TCAGAACTCG CGATATCATA TTAGTAAAAT CCAGGAATTT GGCTTCATAT GGGAAAAGAG ACTGGTTGAA CTGTTCGCTC AACAAATCCT GAGCCTACGT GCTATAAGCA AGATCTTGGG CGTTGATCCC AATACTGTCA AACGTCATGC ACAACGACTT ATGGATACTG GTGATAAAAC CTTAAATAAA GACCTATCTA ATACTAATGC TAGCTTAGAT GCGACTGATC TTCTAATATC ACATCGACAC ACATGGCTTA AAGCGATTAA AGTTTTTCCT CAATTTGGTG TATCCCAATT GAGAAAACGC TTTAGCACGG CCTATGCTTG GCTTTACCGA AATGACAAAG CTTGGCTTTT TCTCCATACC CCAGCCAGAT CTAGTCGGAT TTTCCGTACG AATCGAGTAT CATGGCCAGC CCGGGACAAA TTGCTACTAC AAGCGGTGCA AGAGGCAGCA ACCAGAATTT TAAACCAGGA ACAATGGCCT AAACAAATGA CCTTAGGTGC TATTGGAAGA GAGGCAGCGT GTCTCCCGAT CCTACAGCGA CATTTAGATA AGCTGCCGAA AACGGCAGAT CGTCTGAAAG ATTTGGTGGA AACGAGAGAG GCGTGGGCTG TAAGGCGGCT CCAATGGACT ATTAACCAAG CACAAAATCA AGGACTCATT CTCAAGGGCT GGGAAATTGT TCAACGCTCA GGCTTAGGAC GTTTCCCCAA GAAATGGTTT GAACATCATG GTATACAGTT TTAA
|
Protein sequence | MRFASANIAV EELFGTRNIV AVVDLPSRLD ILSNRLPKAL GLNSELLISN HTSLPLFAPF LSGDALHRVR RSMLGNKGGT VHSRIGIMAS GVKSLRKLRF CPVCVNEDRS KYGECYWHRV HNIAGINVCA HHGCILQDSI LSFQRQSTYY AYNAAEDYIP EIEFADPTPA NLHLYALAKQ AYWLLNNISN ADLKDMYNVY RRCLDLRGLT TASNSIRQQQ LQSTFIDFYT PELLDTISCS LEDVHQTWLS RLVGGDYVAH HPLRHLLVLH FFGHTPESAL KSLDEPTPDP FGHGPWPCLN RVCPHSHKDV IETCNVRRSR YTRDRPIGTF ACPFCGFSYI RTGPDQSQNS RYHISKIQEF GFIWEKRLVE LFAQQILSLR AISKILGVDP NTVKRHAQRL MDTGDKTLNK DLSNTNASLD ATDLLISHRH TWLKAIKVFP QFGVSQLRKR FSTAYAWLYR NDKAWLFLHT PARSSRIFRT NRVSWPARDK LLLQAVQEAA TRILNQEQWP KQMTLGAIGR EAACLPILQR HLDKLPKTAD RLKDLVETRE AWAVRRLQWT INQAQNQGLI LKGWEIVQRS GLGRFPKKWF EHHGIQF
|
| |