Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3814 |
Symbol | |
ID | 6198005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010580 |
Strand | + |
Start bp | 124252 |
End bp | 125901 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641703946 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_001831098 |
Protein GI | 182676951 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAGA AGATCCATTT GGCCTTCGGG CTTGGCATTG CTGTCGAAAA GATCGACCAT CGCGAGGAAG GCTGGGTGGT TTCAGCTCTC GCGTCTGGAA CCCGCAGCTG TCCCGGCTGC GGTGTGGTCT CAACCAGGCG GCACAGTTGG CACGTCAGAC ACCTGCAGGA TCTGCCGATC CAGGGGATCC CTGTGACCAT CGCGCTGCAG CTGGGACGCT GGCGCTGCCG CAACGAGAGC TGCTCGCGCA AGACTTTCGT CGAGAAGATC TCGACCGCCT TTCCGTTTGC CCGACGGACC GCACGGGTCG GTGAGATCAT TCGCCTCTTT GGCCATGCCG CGGGAGGCCG GGTTGGTGCA AGACTGTTGG ACCGCCTTGC CATGCCAACC AGCCATAACA CGGTCCTGAG GCATCTCAAA CGGCATGCTT CGGCGAGCAA GCTCAAGGCT CCCCTTCGGA TTGCCGCTAT CGATGACTGG AGCTGGCGGC GGGGCGAAAC CTACGGCACG ATCATCGTCG ATCTCGAAAG GCGGACCGTT GTCGATGTCT TGCCCGTTCG TTCCGTCGAG AGCACGGAGC ACTGGCTCAG GCAGCATCCT GGCATCGAGA TCGTCAGTCG GGATCGGTGT GGCCTCTATG CCCAGGCCAT TCGGCAGGGC GCGCCCCAGG CCCAGCAGGT GACCGACCGG TTTCATCTGC TGCAGAACCT GCGGGAGGCC ATCGAGCGCC AGATGGAACG GGTCAGCCGG TTTGCCGGTC GCTCTCTTCT GCCAGCGGGT TCTGACGCCA AACGGGAAGC ACCCCGGCAG GCAAGCCGGG AAGCCCGATT GGCGTTATTT CAAAATGTTC ACGAGCTACA CTCGGCCGGA ATGCCTATCA CGGCGATCAA GGACAAGACC GGGCTTGCCC TGCACACGCT GCGCCAATGG GTCCGTCTGG ATGACTTGCC GGCGCGCCGT CACCCCGCTC CGACCGCCAG ATCACCGGCT TCTTTCAAGG ATTTCCTGAA GCAGCAATGG GAGGCTGGAA ACCGATGTGG ACGCCATCTT CTGCATGATC TCCGCCATCG CGGCTATACG GGCAGCCGCT CTCACCTCTA TCACTTCATC GCGGAATGGC GGCGGCTTGA GCCGGATGAG AGCAGGAACA TCAAGGCGTC ACCAACACCA CATACGCCAC TGGCGGAAAC GAAAGCAATT GACCCAGTGA CGGGCTGGCA GATCTCGCCG AAAGTCGCCG CAGTGCTGTG CCTGAAGCCG ACGCGCTTGC TGACACCCCG TCAAGCACTC AAGGTCAAAG CCCTGAAACA GGCCTCTCCA AGCTTCGTCA CCATGCGGGC TCTAGCGATG CGGTTTCGTG GTCTTATGCG CAGCAAGGAG CCATCAAAGC TCGAAAAATG GCTCGAGAAG GTAAGACATG CCGCCATTCT CCCTCTGCAG CAATTTGCCA AAACGCTGAG GCGTGATCTC GCCGCTGTCC GAAACGCTAT TACTCAGCCT TGGAGCAGTG GGCAAGCGGA AGGACAGATC AACCGCTTGA AAACACTCAA GCGGACGATG TACGGAAGAG CTGGCAATGA GCTGCTCCGC GCTCGGATGA TGCCGTTTGA TTTCGTAAAT GAAACTGTGA ATGGAATGCC TGATCCTTGA
|
Protein sequence | MRKKIHLAFG LGIAVEKIDH REEGWVVSAL ASGTRSCPGC GVVSTRRHSW HVRHLQDLPI QGIPVTIALQ LGRWRCRNES CSRKTFVEKI STAFPFARRT ARVGEIIRLF GHAAGGRVGA RLLDRLAMPT SHNTVLRHLK RHASASKLKA PLRIAAIDDW SWRRGETYGT IIVDLERRTV VDVLPVRSVE STEHWLRQHP GIEIVSRDRC GLYAQAIRQG APQAQQVTDR FHLLQNLREA IERQMERVSR FAGRSLLPAG SDAKREAPRQ ASREARLALF QNVHELHSAG MPITAIKDKT GLALHTLRQW VRLDDLPARR HPAPTARSPA SFKDFLKQQW EAGNRCGRHL LHDLRHRGYT GSRSHLYHFI AEWRRLEPDE SRNIKASPTP HTPLAETKAI DPVTGWQISP KVAAVLCLKP TRLLTPRQAL KVKALKQASP SFVTMRALAM RFRGLMRSKE PSKLEKWLEK VRHAAILPLQ QFAKTLRRDL AAVRNAITQP WSSGQAEGQI NRLKTLKRTM YGRAGNELLR ARMMPFDFVN ETVNGMPDP
|
| |