Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5103 |
Symbol | |
ID | 5737061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 135905 |
End bp | 137149 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641282268 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001547859 |
Protein GI | 159901613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.116678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATC CTAGCACACC ACCGACCGTC ATTGCAACCC TTGACATCGC CAAACATACG CATTGGTTCG CTGTCTTTGC GCCCGATCTC ACCCCGATCA TTCCACCCCA CCCCATCACC ACCGATGCGA CTGCCTTGCA ATCGGTGATC ACCACCCTCG CCCAGCTTGC GCTTGCTGGC CCGGTCGCGC TCGCCATGGA GCCAACCAGT ATCTATCATC TCCCCTGGCT GCATGCCCTC ACGGCGGCAC TCCCGCCCAC CGTCACCTGC CTGCTCGTCC ACACCACCGC CGTGCACCAT GCCCGAACCC GCCTGACGGC AGGTCGGCTC CGCAAAACCG ATGCCCGCGA TTGCCATGCC ATTGCCGCTG CCGTGCGCGA TGGTCATGGC CGCCCGTGGT CGCCGCCCTC CCCCCAGCAG GCCCAATTCC GCACATGGGC CGCCCAAGAA GCCGCCACCA TGGAGACCCT TACCCAGCTC GCCCACGCCC TCCAGCGCCT GACCGATCTG CTCTGGCCCG GCTTGGTCGC TCGCCGCAAC GCCGCCAGCA CACCGTTGGT CTCCTCGCGC CTATGGACGC GCCACATCAT CCAGACCATC CTGCTCCACC ACCCCGATCC CCATACCTGG CGGTCGCTCT CGGTCGCCGC GATCCGCGCC CGCCTGAAAG CGCTCGGCAT GCGCTGTGGG ATCGGTCGCG CCACCCACCT CGCCGCCATT CTTGCCGCAC AGGTCGTGTT GCCACCGGAG CAAACGCCAC CCTTAGCGGC TCGCCTGACC ACCCTGATGC AGCAGTATTG TCACCACGCC ACCCACCTCG CCGCCCTGCA TGCCGAGGCC GAAACCCTCG TTGCTGCCTC ATGGGCAGCG GTCTTGTGCT CCATCCCGGG CATGAGTCCA GTGCTGGCGG CGCGCTATGC GGCGGCGGTC GGCGATATCC ACACAATCAC CTCGGCCAAA GCCCTCTGGT CGCTGGCGGG ACTCGAACCA AGCATCTATG CCAGTGGCGT GCGATCCCGC GTTGGGCAAT CCTCGTTGGC TGGCCGGATC ACGCTGCGGC AGGCCTTGAT CCGCATTGGC GCAAGCCTCA GTCGCCATTG TCCACCTGTG CGGGCCTGCT GGCTCGCGGC GCGGGCGCGG CGCAAACCGC TCGCTGTTGC GCTGATTCAT GCCGCGAACA AGGCCAATCG GCTGCTGTTT GCACTGGCGA TCAGTCAACA ACCCTATCAG CCCGGCAGGG CGTGA
|
Protein sequence | MSHPSTPPTV IATLDIAKHT HWFAVFAPDL TPIIPPHPIT TDATALQSVI TTLAQLALAG PVALAMEPTS IYHLPWLHAL TAALPPTVTC LLVHTTAVHH ARTRLTAGRL RKTDARDCHA IAAAVRDGHG RPWSPPSPQQ AQFRTWAAQE AATMETLTQL AHALQRLTDL LWPGLVARRN AASTPLVSSR LWTRHIIQTI LLHHPDPHTW RSLSVAAIRA RLKALGMRCG IGRATHLAAI LAAQVVLPPE QTPPLAARLT TLMQQYCHHA THLAALHAEA ETLVAASWAA VLCSIPGMSP VLAARYAAAV GDIHTITSAK ALWSLAGLEP SIYASGVRSR VGQSSLAGRI TLRQALIRIG ASLSRHCPPV RACWLAARAR RKPLAVALIH AANKANRLLF ALAISQQPYQ PGRA
|
| |