Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0712 |
Symbol | |
ID | 6374377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 753005 |
End bp | 754063 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642683223 |
Product | transposase IS4 family protein |
Protein accession | YP_001959149 |
Protein GI | 189499679 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.147808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA TCAATCCTCT TGGCCTTTTC GACGAACATT TTCTGCTGGA ACGGCTCACC AAGCTCAAAG ATCCATTGGT GAAACTGGAT ACATATATCG ACTGGAACAT CTTTGCACCG ATCCTGAATG TCGTCTTCAG TAAGCCTGAA AACAGTAGCA AAGCAGGTCG CCCTCCGTTT GATAGAGTCA TGATGTTCAA ACTGCTCATT CTACAAAGCT TGTATAGTCT CTCCGATGAT CAGATGGAGT TCCAAATAAC AGACAGGCTG AGCTTCAAGC GCTTTCTGAA GCTGAAGACC AGCGACAAGG TTCCCGACAG CAAGACCATC TGGAAGTTCC GTGAAACCCT CATCCAGGAA GGGGTTATCG AAGCTCTGTT TCACCGGTTC AATGAGGCCC TTGACGACCA GTCCGTCTTT GCAAATACCG GCCAGATTGT CGATGCCAGT TTTGTTGAAG TGCCCCGTCA GCGCAACACA CGGGACGAGA ACCAGCAGAT CAAGAAAGGC GAAACCCCTG AAGCCTGGAA AGCAAGACCT AACAAACTTC GTCAAAAAGA TCGTGACGCT CGCTGGACGA AGAAAAATAA GATGTCTTTC TATGGCTACA AGAACCATAT AAAAGCCGAC AAGGGAACAA AGCTCATCAG CGACTACATG GTTACCGATG CTTCAGTTCA TGATTCACAG GAGCTTGAAA CCCTTATCAG TACCGACGAT GGCGGTCAGA AACTGTACGC AGACGCGGCC TATATTGGAC AGGAAGAAAC TATCGAAAGC TGTGGTATGA GGAATATGGT TCATGAAAAA GGCAACAGGT ACCATAAACT CACCGATGCC CAGAAGGCTT CGAACAAAGA AAAGTCTCGT ACCCGCGCCA GAGTTGAACA TGTGTTCGGC TTCATGACCA ATTCGATGAA CGCCATGTAC ATCAGAACCA TTGGCTACAT ACGGGCAACA GGCAAGATTG GATTAGCCAA CTTGACCTAT AACATGATGC GCTGCACACA GCTGAAGAAG AAAGTGCACA ATGTTTTCCT GCGGGATAGC TACGCCTAA
|
Protein sequence | MKNINPLGLF DEHFLLERLT KLKDPLVKLD TYIDWNIFAP ILNVVFSKPE NSSKAGRPPF DRVMMFKLLI LQSLYSLSDD QMEFQITDRL SFKRFLKLKT SDKVPDSKTI WKFRETLIQE GVIEALFHRF NEALDDQSVF ANTGQIVDAS FVEVPRQRNT RDENQQIKKG ETPEAWKARP NKLRQKDRDA RWTKKNKMSF YGYKNHIKAD KGTKLISDYM VTDASVHDSQ ELETLISTDD GGQKLYADAA YIGQEETIES CGMRNMVHEK GNRYHKLTDA QKASNKEKSR TRARVEHVFG FMTNSMNAMY IRTIGYIRAT GKIGLANLTY NMMRCTQLKK KVHNVFLRDS YA
|
| |