Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1107 |
Symbol | |
ID | 3748325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1493593 |
End bp | 1495860 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637773638 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_379412 |
Protein GI | 78189074 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAGC CATCAAAGCC CCATAGTGCC GTAAAACCAA AGCGAGCAAT AGGTTTATCG TTGGCACCTT ATCCCGCTGT GCGGTTGCTT TTTTTTGTCA TTATAGGAAT TGTTGTAGGG GTTGTTGCAC CTTTTTCGCT CACGGAGTGG CTATGGAGCG TAGCGTTATC ATTTGCGCTG TTGTTGCTTA CATGGCTTTA TGAGCGAATT CGTTATCACC AAGCGGCTGT ACCGCATTTT GGCATGGCGA TAATGTATTG TTTCGTTGTG GTTTCGGTGT TTGCAACGCT CAGTGCTTAC CGATTGCACT ATGCACCACG TAACGGTTTA ACGCAATATG CTGGGCGCAC CGTGATTCTT TATGGCTCCA TTGAAAGCCG TCCCGAACGT TCTAAAGGTG GTGCAAGTTG GGTTATGGAG GTGCAAGAGC TTTTTGAACA TGGCAAAACC GTTACGCTTC GCGACCGCAC GAAAGTATTT ATGCGCATGA GTGCCGATGC TCATCTTGCT GTTCAAAAAG GCGATATGGT TCGCGTAAAA GGCAAGCTTG ATTTGTTGCC CGAAGCTGCT AATGCAGGGG AGTTTAATCC TCGCCATTAC GGGGCTATGC AACAAATCTC TGTGCAGCTT TATGCTGCCG GTCCATGGCA GGTGCTCTAT GAGGGTGAAA AGCGCTTACA TCCATTCGAG CAATATATGG TGCAACCAAC CTATCGCTAC ATTATGCAGG CGCTTGCCGC ACTTCTGCCC GATGGCGAGG AGCGTAAGTT AGCCGCAGGC GTGCTTACGG GTGAGCGCGA AACCATGTCG GAAGAGGTGT TTGAAGCTTT TAAGCGCACA GGTACAGCAC ATATTTTAGC CGTCTCAGGC ATGAATGTTG GCTTGTTGGC ATTAATTATT CAAGTATTTT TGCAACGGCT TAAAATTACG CCATTTGGGC GTTGGACGGC GTTTTTGCTC TTTGTTTTTT TGTTGATACT CTATAGCAAT GTTACGGGGA ATTCAGCCTC AGTAACGCGT GCGGCATTTA TGGCGTTAGT ATTGATTGCT GGAGAAACTG TTGGGCAAAA AACATATCCG CTGAATTCCT TAGCCGTTGC TGACTTGATT ATTTTGCTTA TTAATCCGCT TGATCTCCTG AACCCAGGTT TTTTAATGAC CAATGGCGCC GTTTTAGCGC TTTTTCTTGT TTATCCGCTT CTCCATTTTC CACGCCCTAA AAATCGAACC CTTCTTTTAT CAATAGTGTG GTTTCTGCTT GATAGCATCA TTATTACCCT TGCCGCAAGC ATTGGCGTTT CACCCGTAAT TGCCTACTAC TTTGGCACCT TTTCGCTTAT TAGCTTTGTT GCAAACATTC CTGTTGTCTT TTTTTCTACC TTGCTGATGT ATGCCTTAGT GCCAATGTTG GTTGTGTATG GGTTGTCGCA AGCTCTTGCC TCCGTTTTTG CCGCAGGCGC TTTTTGGCTT GCCCGCATGA CATTGCAATC GGCATTGTGG TTTAGCAATT TCTCTTTTGC TTCAATTCCC TTAAAACTTG ATGCGGTAGA GGTGTGGCTT TATTACATAG TGCTTGCAGC GGTGCTCTTG CTGGCAACAC GTAAAGCGTG GAGCCGTGTT GCCATAACGT TTTTGTTGGG AGTAAATCTT TTTGTGTGGT ACTCCTTGCT TTTTCGTCCA AATCCAATAG CACCAACACT GCTTACTGTT AACTTAGGTC GGAATCTTGC AACAATTGTT TCCAATGGCA GCGAAAGTGT GCTTATTGAT GTTGGCAAGA AACCCAAAGA TTATCAACGC ATTAGCGCCC AATTTGAGCG TTTTGGCATT GTAGAGCCCA CTGCCGTTGT ACAATTTTAT TCACCCGACT CGTTGATTTT GGCAACGCCA ACTCGCCACC ATTTTTTACG GAGCGATAGT CTGCTGCGGC TTTCTTCCAT GGTTATTACG CGACCCGACG AAAAAATGGT AAAGCTTTGG AGCCGCAACC AAAGCTATTT TCTTGCTTCA GGCACCAGCC GTTTAAAAGC GGGGGAGCCC TATTGTGGCG ATGTGGCTTG TATTTGGATT TATCGCTTTG GCGAAAAGCA GCGCATTGAG TTGGAACGGT GGCTTACCGC AACAAAGCCA AAAGAGGCTC TTTTGGTGCC AAGCTCATTT TTATCGCGCG TGCAGCTTGT GGCATTGCAC CGTTTTGCAG CGGCTTATCC TCATGTTGAG GTGCGTAGTA AAACTAAGCA GGTGGTGGTG AATGGGGGAG AGAGGTAA
|
Protein sequence | MTEPSKPHSA VKPKRAIGLS LAPYPAVRLL FFVIIGIVVG VVAPFSLTEW LWSVALSFAL LLLTWLYERI RYHQAAVPHF GMAIMYCFVV VSVFATLSAY RLHYAPRNGL TQYAGRTVIL YGSIESRPER SKGGASWVME VQELFEHGKT VTLRDRTKVF MRMSADAHLA VQKGDMVRVK GKLDLLPEAA NAGEFNPRHY GAMQQISVQL YAAGPWQVLY EGEKRLHPFE QYMVQPTYRY IMQALAALLP DGEERKLAAG VLTGERETMS EEVFEAFKRT GTAHILAVSG MNVGLLALII QVFLQRLKIT PFGRWTAFLL FVFLLILYSN VTGNSASVTR AAFMALVLIA GETVGQKTYP LNSLAVADLI ILLINPLDLL NPGFLMTNGA VLALFLVYPL LHFPRPKNRT LLLSIVWFLL DSIIITLAAS IGVSPVIAYY FGTFSLISFV ANIPVVFFST LLMYALVPML VVYGLSQALA SVFAAGAFWL ARMTLQSALW FSNFSFASIP LKLDAVEVWL YYIVLAAVLL LATRKAWSRV AITFLLGVNL FVWYSLLFRP NPIAPTLLTV NLGRNLATIV SNGSESVLID VGKKPKDYQR ISAQFERFGI VEPTAVVQFY SPDSLILATP TRHHFLRSDS LLRLSSMVIT RPDEKMVKLW SRNQSYFLAS GTSRLKAGEP YCGDVACIWI YRFGEKQRIE LERWLTATKP KEALLVPSSF LSRVQLVALH RFAAAYPHVE VRSKTKQVVV NGGER
|
| |