Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1453 |
Symbol | |
ID | 6375131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1575421 |
End bp | 1576827 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642683947 |
Product | transposase IS4 family protein |
Protein accession | YP_001959861 |
Protein GI | 189500391 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.243782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTCC CTTTCGCTAA TATACACACT GAACAGCCCA CCCAACAAGC GCTATTTCCT GATTGTTTCG AAGTATCCGT TGCCCCGGTC AAGGGCAAAA AAGTGGTTCT TGATTTCCAG GGCGGCAACA CCACCAGCGA TGCCGGTGTC CTGCTGTTGA AGGAAGTCGA GTCCATGACC AGGATCGTTC CGAAGCTTGC CGATTGCATT GCCGATTCGC GTCGGACCTC GTCTGTCATG CATGTGATCC ATGACCTGAT CGCCCAGCGG GTCTACCAGA TCGCCTGCGG CTATGAGGAC GGCAACGACA GCAATTCCCT GCGGAAGGAT CCCGCTCTCA AGATGGCCCT CAACCGTCTT CCTGAAAGCG GCGATGACCT TGCCAGCCAG CCGACCTTCA GCAGACTCGA AAACATGGTT ACCCGTCCGG AGCTCTATCG TATGGCTGTC GGGTTCCTCG ATCATTTCCT TGACTCCTAC ACCGAGGCGC CGCGGGTCAT CGTCCTGGAC TTTGACGATA CCGAGGATGT CGTTCACGGC AAACAGCAGC TGGCGCTTTT CAACGGCTAT CACCAGGAGA CCTGCTACCA GCCTCTCCAT GTTTTCGAGG GGTTGACTGG CAAGCTGATC GCCTCGATCC TTCGCCCTGG CAGGCGTCCT ACCGGCAAGG AGATTGTGTC ATACGTGAAG CGTATTATCC GCCATATCCG GAGCCGGTGG CCGGAAACGA TCATCGTCTA CCGCGGCGAC AGCCATTACG GCGTGCCAGA AGTCTACTCC TTCCTTGCTT CAAAGCGGAA CTGCTACAGC GTGACCGGCC TCGGCGGTAA TGACGTGCTG CTCCGCTCCG TCAAGGACAT TATTGAGGAG GTCAAGAAGC ATGGAGCCGG ATACCGCCGT TACCATACCT TCCAGTATCA GGCACGGAGC TGGAATGGGA GCCGCAGAGT GGTCGCCAAG GTCGAGATGA CCGAAAAGGG GTTGAACGTG CGCTTCATCA GCACCGACAT ACAGGAGGCA AAGGCCAAGA CTCTGTACGA GCAAATTTAC AGCGCTCGTG GCAACGATGA ACTCTACATC AAGGCGCATA AAACGTTCAT GAAGAGCGAC CGGACCTCGT GCCATCGCTT TCTTGCCAAC CAGTTCAGGG TCTTCCTGCA TTCGGCGGCC TATGTCCTGG TCCACGCCTT CCAGACCAAC CTGCTCCGGG GCACCGCCCT TGCCACGGCG ACTTTCGAAA CGATCCGCTT GAAGCTGCTG AAAATCGGGG CGAAAGTCAT CGAGATGAAG ACACGCATCA AGGTGCATCT GCCGACCTCA TATCCGTACA AACCGATACT GAACAAGTGC CTCACCATCC TTGAGCACCT GCGATCAGTC CCATGGCCAT CAACAGCAAT TCCGTAA
|
Protein sequence | MQLPFANIHT EQPTQQALFP DCFEVSVAPV KGKKVVLDFQ GGNTTSDAGV LLLKEVESMT RIVPKLADCI ADSRRTSSVM HVIHDLIAQR VYQIACGYED GNDSNSLRKD PALKMALNRL PESGDDLASQ PTFSRLENMV TRPELYRMAV GFLDHFLDSY TEAPRVIVLD FDDTEDVVHG KQQLALFNGY HQETCYQPLH VFEGLTGKLI ASILRPGRRP TGKEIVSYVK RIIRHIRSRW PETIIVYRGD SHYGVPEVYS FLASKRNCYS VTGLGGNDVL LRSVKDIIEE VKKHGAGYRR YHTFQYQARS WNGSRRVVAK VEMTEKGLNV RFISTDIQEA KAKTLYEQIY SARGNDELYI KAHKTFMKSD RTSCHRFLAN QFRVFLHSAA YVLVHAFQTN LLRGTALATA TFETIRLKLL KIGAKVIEMK TRIKVHLPTS YPYKPILNKC LTILEHLRSV PWPSTAIP
|
| |