Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1722 |
Symbol | |
ID | 4571082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1948838 |
End bp | 1951039 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639766304 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_912163 |
Protein GI | 119357519 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCAA TTTTCGCTCC ATATCCTGCA CTCAGGCTTC TGTTACAGGT TATAGCAGGT ATTCTTTGCG GAACGCTGCT TCTTGTGCCG CTTGAGGCGT GGCTTGCTTG CTCACTTTTC TTTTTTATTC TGCTTTCCTG CGCTCTTTTT TACGAGTTTA TTGGACGTAA AGGTTCGTTT CCACTATCGT TGACGGCGAT CGGGTATTTT TTTTTCGTGG TTGTTGCTTT TGCGGCCTAT AGCGACTATC TGGTACACTA TCAGTCCAAA GCCGGACTTT TACGTTACAG CGGAAAAGAG GTGATTCTCT ACGGGCGGGT TGCCGACAGG CCGGAACGTA CCGAAAAGGG AGTCAAGTGG ACGATGGAGG CAGAAGAGCT GTTTTTTGAA GGCCGGACAA TCACACTTCA TGAGCGGCTC AAGGTGTTTA TGAGGGCACC GGCGGGCGAA AAGATCCGAA TGGCTTATGG CGACAGGATT CGCGTGAAGG GACAGATTGC CGCGATTCCC GGTGCTGCAA ACCGTGGAGA ATTCGATCCC CGATACTATG CGATGCTTCA GCAGGTCAGG GTGCAGCTAT ACTGTCACGG TCCATGGATG GTGTTGCATG ATGGCCGGAA CACTCTGAGT GTGTTTGAAC GCTTTGTGGT GCAGCCGGTT TATGCCTATG TTGTCCGGAG CGTTGAAACG CTGATTCCGC CGGGCGAAGA GCGCAAGCTT GTCAGGGGCG TGCTGCTCGG AGAAAAGGAG GTGCTGACCG AAGAGGTTTT CGACGCCTTC AAACGAACCG GCACAGCGCA TGTGCTTGCT GTTTCGGGCT TCAACGTCGC GCTTCTCGCA CTCGGCATTC ACGTCTGTCT GCAGCGCGTC AAGGTGACAG CTCAGGGCCG ATGGATCTCT TTTCTTCTGA TTGCCTTTAT TCTGCTCGTT TTCAGCTATG TCACCGGCAA TTCCGCATCG GTAAAACGTG CGGCCATCAT GTCGGTCGTG CTCCTCGGTG GCGAGACGCT CGGGAGAAAA GCCTTTGCGG TCAACTCACT TGCCGCATCC GATGTGATTA TTCTGTTTAT TGACCCGCTT GACCTGTTCA ATCCCGGTTT TCTGATGACC AACGCCGCGG TTCTTGCCAT TCTGCTGGTC TATCCGCTTT TTTACGGATC GGATAAGGAG GAAGAGGGCG TATGGCGCAG GATATGGAGA TTTCTGCTCA GCAGTTTTTT TGTCAGTCTT GCCGCCATCA TCGGGGTTTC TCCCGTTATA GCCTATTACT TCGGTACCTT TTCACTTGTC AGTCTTGTCG CCAATCTGCC GGTCGTTCTC TTTTCAACGC TCCTGATGTA TGCCCTGATG CCGATGCTGC TTGTAAACCT CGTTTCAGGG TATCTCGCCT CATATTTTGC CATGAGCGGC TATCTTTTTG CCTGGTTGAC TCTCCAGTCT GCTCTTTTTT TCAGCAAGCT CCCGTTTGCC TCGATAGAAC TTAAACCCGA TATGATCGAA GTGGCGATCT ACTATACTGT ACTGACGGTT GCAGCATGGT ATCTGTACAA AAAAACATGG GGCAGGCTTG CCATTACCCT GCTTCTTGGT CTCAATCTGT TTTTCTGGTA CTCTTTTCTC TCCGCCGGGG AACAGGGCAG GGGAGTGGTG ACGGTGAATC TTGGAAAAAA TCTTGCCGTT CTCTTTTCGA CGCATGGCGA GACGGTGGTG ATCGATACCG GAACCCGATC ACGCGACGTT GAACGAGTGA TGCGCCAGGT CAGTGAGTAC AGGTTGGCCG AGCCTGTGGC AGCGGTACAG TTTTTTTCGC CGGATTCACT TGTCGGGATG GTGCCGGTAA AGCGCCATCT TCGGCAGCAG GAGAACTCCC TTGCGCTTGC CTCCATGGTG ATCGTGCGTC CGGAGGAGAA GGTGCTGAAA CTCTGGAGCA GAGAGCGCTC GCTGCTTCTG ATCTCCGGCA CCAGCCGATT GAAAGAGGAG GAACTCTACA AAGCCGATGT GGTGCTGCTC TGGGTTTACC GGTTTGCCGA AAAACAGCGG CGGGAAATTT CTTCATGGCT GAACTATGCT CGTCCAATAA AGTGCATCAT CGTTCCAGGC TCGTTTCTCA CCCTTCGACA GAAAGAGGAT CTCATCCGTT TTGCCGCAGC TCATCCCGAA CTCGAAATCC GTTCCAAAGC CCGACAGATC GTGATTCGCT GA
|
Protein sequence | MSAIFAPYPA LRLLLQVIAG ILCGTLLLVP LEAWLACSLF FFILLSCALF YEFIGRKGSF PLSLTAIGYF FFVVVAFAAY SDYLVHYQSK AGLLRYSGKE VILYGRVADR PERTEKGVKW TMEAEELFFE GRTITLHERL KVFMRAPAGE KIRMAYGDRI RVKGQIAAIP GAANRGEFDP RYYAMLQQVR VQLYCHGPWM VLHDGRNTLS VFERFVVQPV YAYVVRSVET LIPPGEERKL VRGVLLGEKE VLTEEVFDAF KRTGTAHVLA VSGFNVALLA LGIHVCLQRV KVTAQGRWIS FLLIAFILLV FSYVTGNSAS VKRAAIMSVV LLGGETLGRK AFAVNSLAAS DVIILFIDPL DLFNPGFLMT NAAVLAILLV YPLFYGSDKE EEGVWRRIWR FLLSSFFVSL AAIIGVSPVI AYYFGTFSLV SLVANLPVVL FSTLLMYALM PMLLVNLVSG YLASYFAMSG YLFAWLTLQS ALFFSKLPFA SIELKPDMIE VAIYYTVLTV AAWYLYKKTW GRLAITLLLG LNLFFWYSFL SAGEQGRGVV TVNLGKNLAV LFSTHGETVV IDTGTRSRDV ERVMRQVSEY RLAEPVAAVQ FFSPDSLVGM VPVKRHLRQQ ENSLALASMV IVRPEEKVLK LWSRERSLLL ISGTSRLKEE ELYKADVVLL WVYRFAEKQR REISSWLNYA RPIKCIIVPG SFLTLRQKED LIRFAAAHPE LEIRSKARQI VIR
|
| |