Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1921 |
Symbol | |
ID | 6375613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2084513 |
End bp | 2086009 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642684415 |
Product | transposase, IS5 family, putative |
Protein accession | YP_001960316 |
Protein GI | 189500846 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.2077 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCAAC AGAAGTTTCA GCAGCTCACG TTCGAAAATT TCCATCTGCC GTTCGGCGGT AAACTCGATC CGGAAAACCG GTGGGTGAGG CTCGCTGACG TAATTCCCTG GCACGTCGCC GAAACGATGT ATGCCAAAAA CTTCATGTCA AAACGGGGTG CTCCGGCACT GACGGTTCGT ATGGCGCTGG GGTCTTTGAT CATCAAGGAG AAACTCGGCC TTTCGGATAT CGAGACGGTC GAACAGATCA AGGAGAACCC GTATCTCCAG TACTTCATCG GTCTTGAATC GTTTCAGCAT ATCGCGCCCT TCGACGCCTC GATGCTGACC CACTTCCGGA AGAGGCTGAA GCATACCGAT CTCGGTGCGC TGCAGGAGGA ACTCCTGCAA CGCCATCTGG CTGAAGAGCG AAAGAGGGCT GAGGAGAAGA AAGAGAACCA GAATGGCGAC GACGATGGAA ACGAGGGTCC TGCCAATAAA GGCAAGCTCA TCGTCGATGC CACTTGTGCG CCAGCAGATA TCGCCTATCC GACAGATATC GGCTTGCTGA ACGATGCTCG GGAAAAGACC GAGCGGATCA TTGACGAACT GTATGCCATG CACCCGGAAG GAGTGAGCAA GCCGAGGACG TACCGTAAGA GGGCAAGGAA GGATTTTCTT GCCCTCGGCA TGAAAAAGAA GTTGTCGAAG AAAGCGCTCC GCAAGGGGCT TGGCAAGCAA CTGCGATACC TCAGGAGAAA TCTGGAGCAC ATTGCCATGC TATCTGGTTC AGTGCCGCTG ACGGTCCTGT CAGCACGTTG GCATCGGGAC CTGCTGGTGA TCGGTGAATT GTACCGGCAG CAGGTGGAAA TGTGGCAGAG CGGCAAGAAG AACATCAGCG ACCGCATCGT CAGCATCAGC CAGCCGCACG TCAGGCCGAT CGTACGGGGC AAGGCCGCAG CCAAGACAGA GTTTGGCATG AAGCTCTCGA TCAGCGTCGT CGACGGCATC AGCATGCCGG AGAGGATGAG CTGGAACGCC TACAACGAAG GCTGCGATCT GGTGCGAGAC ATCGAGCGGT ACCGTGAGCG ATACGGCCAC TATCCTGAAT CGGTCCATGC CGACAAGATC TACCGGACGC TGGCCAACCG GATGTGGTGC AAGGCTCGAG GGATCCGGCT GAGCGGCGTG CCGCTCGGTC GGCCCCCGAA AGACGTTGAG AAGAACCGGG CCCGCCGGCG CCAGATTAGG GAAGACGAAG GCATCAGGAA TGCAGTTGAA GGCAAGTTCG GCCAAGCGAA GCGCCGATAC GGACTGGGCC GGGTGATGGC CAGGCTGGCT GAAAGCAGCC TGAGCGCGGT CTCAATCACA TTCCTCGTGA TGAATCTGGA CCGGGTGCTC GCCGCTCCTT TTTTGTGCCT GTTTGAATGG CTCTTTTTGG AGCTTGACGT AATCAGAAAT TTGTTCGCGT GGTCATCTCC CCGAGCGGCG ATTGGGTGCC GCGGGGCGAT GGCTTGA
|
Protein sequence | MYQQKFQQLT FENFHLPFGG KLDPENRWVR LADVIPWHVA ETMYAKNFMS KRGAPALTVR MALGSLIIKE KLGLSDIETV EQIKENPYLQ YFIGLESFQH IAPFDASMLT HFRKRLKHTD LGALQEELLQ RHLAEERKRA EEKKENQNGD DDGNEGPANK GKLIVDATCA PADIAYPTDI GLLNDAREKT ERIIDELYAM HPEGVSKPRT YRKRARKDFL ALGMKKKLSK KALRKGLGKQ LRYLRRNLEH IAMLSGSVPL TVLSARWHRD LLVIGELYRQ QVEMWQSGKK NISDRIVSIS QPHVRPIVRG KAAAKTEFGM KLSISVVDGI SMPERMSWNA YNEGCDLVRD IERYRERYGH YPESVHADKI YRTLANRMWC KARGIRLSGV PLGRPPKDVE KNRARRRQIR EDEGIRNAVE GKFGQAKRRY GLGRVMARLA ESSLSAVSIT FLVMNLDRVL AAPFLCLFEW LFLELDVIRN LFAWSSPRAA IGCRGAMA
|
| |