Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3939 |
Symbol | |
ID | 5735800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4934549 |
End bp | 4935754 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641281090 |
Product | transposase IS4 family protein |
Protein accession | YP_001546701 |
Protein GI | 159900454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCAAC CCTATCCGTT AACCAACCGA GGTCGTATGC CGAGTATACC AGCCCTTGCC CAGTGGATCA ACACCATCCT GCTGACCGCC GTGCCTACCC TCTCGCCCTG GACGGCCCGG CGGCTCACCG ATTGGCTGCT CAGCATCCTG CTCATGCCGT CCATCACCAC CCGCGTCGTG GCCTGGGGCT GTGCCCTTGG ACTGTCCACC GCTGCCCGCG CCGCCAGTCA CGAACGGCGA TTGCGCCGCA CCTATCGGGA TTCCCAGCTG TCGTGGTCGC TCCATCGGGC CATCCTGACC ACCACCCTGC GCTTCGCCCC TGCCGAAGCG GTCACCGTCA TCATTGATGA GACCACGCAT ACTGACCACT GGACGCTCCT CACCGCCGCC CTCTGGTATC ACGGGCGTGC CATTCCGCTT GCTTGGGTGC TCCATCCCGG CAATACCCGC CGTACCACCG CCTTCTGGAT GGATACCGCC ACCCTACTCG ATCGCGTGCA GCAGGTATTG CCAGCCACCA CGGACGTGGT TGTGGTGGCC GACCGTGCCT TTGGTTGTCC CGCCTTTACC GATCAGGTGG CGGCCCACGG CTGGGGCTGG GTCGTGCGCG TCCAAGGCCA CACCCGTATG CTCCTACGGG GCCATACCGA AGTCCCGATT CGCACGCTGG TGGGTCGGGG CCAGCGGGTG GTGCGGCGGG GGCAGGCCTT CAAGAAGGCG GGCTGGCGGA CGGTGACGGT GGTCGCGGAG TGGACAGCGG TGGCGCAGGA ACCGTTGCTG CTGGTCAGCA CTCTCGCGGG GATTGGGGCG ATCCGCCACG CCTATGGGCG GCGCTCTGCG ATCGAAGCGC TGTTTCGCGA TTGGAAAACG GGCGGCTGGC AATGGGAGGC GAGCCAGTCG CGGAGCCAGA CGACGCAGGA GGCCTTGGTG CTGGGCATGG CGATCGCGAC GGTGCTGGTG CTGCTGGTCG GGACGGCGGA GGCGCAGGCG GTGCTGGCCG AACGCGGGGA TCGCCCCAGC CCGCGCCGCC CATGGGCGGC ACGAGAAAGT CTGTTTCGGT TGGGGCGGTA TGGGGTGCTG CGCTGGCTGT GGACGGGAAC GCAGCCAGCG CTGGGAGCGC GACTGTCGTT GGCGGAAACG GCGCTGCACG AACGGTGGGC CACGACGGTG ACGCGGGGTG GTCGGCTCGG GACGGCCATC CCCTAA
|
Protein sequence | MVQPYPLTNR GRMPSIPALA QWINTILLTA VPTLSPWTAR RLTDWLLSIL LMPSITTRVV AWGCALGLST AARAASHERR LRRTYRDSQL SWSLHRAILT TTLRFAPAEA VTVIIDETTH TDHWTLLTAA LWYHGRAIPL AWVLHPGNTR RTTAFWMDTA TLLDRVQQVL PATTDVVVVA DRAFGCPAFT DQVAAHGWGW VVRVQGHTRM LLRGHTEVPI RTLVGRGQRV VRRGQAFKKA GWRTVTVVAE WTAVAQEPLL LVSTLAGIGA IRHAYGRRSA IEALFRDWKT GGWQWEASQS RSQTTQEALV LGMAIATVLV LLVGTAEAQA VLAERGDRPS PRRPWAARES LFRLGRYGVL RWLWTGTQPA LGARLSLAET ALHERWATTV TRGGRLGTAI P
|
| |