Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1082 |
Symbol | |
ID | 6374756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1170690 |
End bp | 1172060 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642683583 |
Product | transposase IS4 family protein |
Protein accession | YP_001959501 |
Protein GI | 189500031 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000105958 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.448539 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTCC CTTTCGCTAA TATACACACT GAACAGCCCA CCCAACAAGC GCTATTTCCT GATTGTTTCG AAGTATCCGT TGCCCCGGTC AAGGGCAAAA AAGTGGTTCT TGATTTCCAG GGCGGCAACA CCACCAGCGA TGCCGGTGTC CTGCTGTTGA AGGAAGTCGA GTCCATGACC AGGATCGTTC CGAAGCTTGC CGATTGCATT GCCGATTCGC GTCGGACCTC GTCTGTCATG CATGTGATCC ATGACCTGAT CGCCCAGCGG GTCTACCAGA TCGCCTGCGG CTATGAGGAC GGCAACGACA GCAATTCCCT GCGGAAGGAT CCCGCTCTCA AGATGGCCCT CAACCGTCTT CCTGAAAGCG GCGATGACCT TGCCAGCCAG CCGACCTTCA GCAGACTCGA AAACATGGTT ACCCGTCCGG AGCTCTATCG TATGGCTGTC GGGTTCCTCG ATCATTTCCT TGACTCCTAC ACCGAGGCGC CGCGGGTCAT CGTCCTGGAC TTTGACGATA CCGAGGATGT CGTTCACGGC AAACAGCAGC TGGCGCTTTT CAACGGCTAT CACCAGGAGA CCTGCTACCA GCCTCTCCAT GTTTTCGAGG GGTTGACTGG CAAGCTGATC GCCTCGATCC TTCGCCCTGG CAGGCGTCCT ACCGGCAAGG AGATTGTGTC ATACGTGAAG CGTATTATCC GCCATATCCG GAGCCGGTGG CCGGAAACGA TCATCGTCTA CCGCGGCGAC AGCCATTACG GCGTGCCAGA AGTCTACTCC TTCCTTGCTT CAAAGCGGAA CTGCTACAGC GTGACCGGCC TCGGCGGTAA TGACGTGCTG CTCCGCTCCG TCAAGGACAT TATTGAGGAG GTCAAGAAGC ATGGAGCCGG ATACCGCCGT TACCATACCT TCCAGTATCA GGCACGGAGC TGGAATGGGA GCCGCAGAGT GGTCGCCAAG GTCGAGATGA CCGAAAAGGG GTTGAACGTG CGCTTCATCA GCACCGACAT ACAGGAGGCA AAGGCCAAGA CTCTGTACGA GCAAATTTAC AGCGCTCGTG GCAACGATGA ACTCTACATC AAGGCGCATA AAACGTTCAT GAAGAGCGAC ACGGACCTCG TGCCATCGCT TTCTTGCCAA CCCAGTTCAG GGTCTTCCCT GCATTCGGCG GCCTATGTCC TGGTTCCACG CCTTCCCAGA CCAACCTGCT CCGGGGCACC GCCCTTGCCA CGGGCGACTT TCGAAACGAT CCGCTTGAAG CTGCTGAAAA TCGGGGGCGA AAGTCATCGA GATGAAGACA CGCATCAAGG TGCATCTGCC GACCTCATAT CCGTACAAAC CGATACTGAA CAAGTGCCTC ACCATCCTTG A
|
Protein sequence | MQLPFANIHT EQPTQQALFP DCFEVSVAPV KGKKVVLDFQ GGNTTSDAGV LLLKEVESMT RIVPKLADCI ADSRRTSSVM HVIHDLIAQR VYQIACGYED GNDSNSLRKD PALKMALNRL PESGDDLASQ PTFSRLENMV TRPELYRMAV GFLDHFLDSY TEAPRVIVLD FDDTEDVVHG KQQLALFNGY HQETCYQPLH VFEGLTGKLI ASILRPGRRP TGKEIVSYVK RIIRHIRSRW PETIIVYRGD SHYGVPEVYS FLASKRNCYS VTGLGGNDVL LRSVKDIIEE VKKHGAGYRR YHTFQYQARS WNGSRRVVAK VEMTEKGLNV RFISTDIQEA KAKTLYEQIY SARGNDELYI KAHKTFMKSD TDLVPSLSCQ PSSGSSLHSA AYVLVPRLPR PTCSGAPPLP RATFETIRLK LLKIGGESHR DEDTHQGASA DLISVQTDTE QVPHHP
|
| |