Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1447 |
Symbol | |
ID | 3746645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1914605 |
End bp | 1915762 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637773981 |
Product | SMF protein |
Protein accession | YP_379746 |
Protein GI | 78189408 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTC TTAACTTTTT AATGCTATCG CAAGTGCCGG GCATTGGCGC AGCACGCATT AAAGCCCTTC TAACCCATTG GGGCAATCTC AGCTTTTTGC AGCACGCTAC CATTGCTGAC CTTACGCACA TCAACGGCAT TGGCGAAACG CTCGCCACCG AACTCTACAA CACCTTCCAC AATGCCGCAA AAAACGACAC CGTGCGCCGT GCGGCTGAAG CCCAACTGCT CGCCCTTGAG CGCTGCAACG GGCAAGTACT AACGCTGTTA GACGAAGGCT ATCCACCACT GCTGCGCGAA ATTTACGATC CACCTCCTTG CTTGTTTATT CGCGGCACAC TACCACCCAA CACCGAAAAA AGCCTTGCTG TTGTTGGTAC ACGCCACGCC TCAGCATATG GCAAGCAAGT AACCACTCAC TTTTGCCATG CCATTGCCAA GCAAGAAATG CCCATTATTA GCGGTTTAGC ATACGGTATT GATATGGCAG CCCACCAAGC GGCATTGGAT GCGGGGGGCA CCACCGTTGC CGTGCTTGCC AGTGGGATTG ATACCATTTA CACCGATCCT AAAGGGTTGC TTTGGCCTAA AATTCTTGAG CATGGCGCTA TTGTAAGCGA AGAGTGGATT GGCTCACACA TAACGCCCGC AAAATTCCCT AAGCGCAACC GCATTATTTC GGGCATAGCA AAAGGCACAT TAGTGGTAGA ATCCGACCTC AAAGGCGGTG CGCTTATTAC AGCCACCACG GCGCTTGAGC AAAATCGTGA AGTGTTTGCC GTGCCGGGTT CTATTTTTTC ACACACCTCA CGCGGCACCA ACAAGCTGAT TCAGCAAGGG CAAGCCAAAG CCATTATGGA GGTTGACGAT ATTTTAATGG AGCTGCAACC AAGCCAACCC CACCAAGCCA AACCAATACA CCCAACCAAA GCGACTGCAA ACGCAACCAC CACTACGGCA ACAACACAGC TTCCGCTCCT AAACCCGCTT GAAAGCCAAA TTTATCAAGC ACTAAGCAGT AGCGATCCCA CTCACATTGA CACGCTTGCC GCCACCTTGC AATTAGACCT TTCTACGCTC TTCCTCCATC TTTTTGAGCT TGAATTACAA GGAGTTATTG AGCAACAACC GGGGCAACTC TTTTTACGCA AAGCTTAG
|
Protein sequence | MDILNFLMLS QVPGIGAARI KALLTHWGNL SFLQHATIAD LTHINGIGET LATELYNTFH NAAKNDTVRR AAEAQLLALE RCNGQVLTLL DEGYPPLLRE IYDPPPCLFI RGTLPPNTEK SLAVVGTRHA SAYGKQVTTH FCHAIAKQEM PIISGLAYGI DMAAHQAALD AGGTTVAVLA SGIDTIYTDP KGLLWPKILE HGAIVSEEWI GSHITPAKFP KRNRIISGIA KGTLVVESDL KGGALITATT ALEQNREVFA VPGSIFSHTS RGTNKLIQQG QAKAIMEVDD ILMELQPSQP HQAKPIHPTK ATANATTTTA TTQLPLLNPL ESQIYQALSS SDPTHIDTLA ATLQLDLSTL FLHLFELELQ GVIEQQPGQL FLRKA
|
| |