Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1909 |
Symbol | |
ID | 4570868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2216323 |
End bp | 2217996 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766491 |
Product | hypothetical protein |
Protein accession | YP_912349 |
Protein GI | 119357705 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00461375 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGTTG CTGCGATTGA AAAATTGTCT TCAGCAGCAG GAACCGATTC TCGCGGCGCT ATATTTACAC GCGTTGAAGT AGTTGATTTT ATTCTCGATC TGGCCGGCTA TACGGAGGAT CAATTGCTCC ATGAAAAACG GCTTTTAGAG CCATCATTCG GTGGCGGTGA CTTTCTTTTG CCGGCAATTG ATCGATTACT GGCAGCGTGG AAATCATCGC AACATGTAAC ATCTGTCGTC GAAGAGTTAG GTGATGCAAT TCGCGCTGTT GAGCTTCATC ACAAGACCTT TTCTTCCACC CGTGCAGCCG TGATCGAGAA GTTACGACTG GAATCGATAG ATGTTCAATG CGCAGCAACA CTTGCTGATC GTTGGCTTGT TCATGGGGAT TTTCTGCTTT CTCCACTTGA CGGGCATTTT GACTTTATTG TCGGGAACCC GCCCTATGTG CGCCAGGAGC TTATTCCGGC AGCGTTACTG GCTGAATATC GCAATCGATA TCAGACGATG TATGACCGGG CCGATATTTA TATTCCGTTC ATCGAGCGTT CCCTTTCAGT ATTGGGCAAG GGTGGGAGCC TGGGTTTTAT CTGTGCGGAT CGCTGGATGA AAAACCGCTA TGGCGGTCCG CTTCGCAGCC TTGTGGCCAA TCAGTTTCAC CTGAAAATTT ATGTTGATAT GGTAGATACG CCAGCGTTTC TCTCTGATGT GATAGCCTAT CCAGCCATCA CTATTATCAG CAGGGAAACT CCCGGTGCGA CACGCATTGC TCATCGTCCG GCCATCGACA GAGATACGTT GGCAAACCTG GCTGATGCGC TGTGTGCTTC GAACCAGCTG AATGAGGTCG ATTCGATAAG CGAACTGGCA CAAGTTACTA ATGGTGCAGA ACCCTGGTTA CTCGAATCTT CGGATCAGAT GGCCCTGATC CGTCGTCTGG AAAAACAATT TCCCAGCCTG GAAGAGGTTG GATGCAAGGT GGGTATAGGA GTGGCAACTG GCGCGGACAA AGCATATATC GGAGACTTTG ATACACTCGA CGTGGAGCCT GATCGCAAAC TGCCGCTGGT TACAACCAAA GATATTCTGT CAGGAGAAGT GCAATGGCGC GGTCAAGGTG TCATCAATCC TTTTGTCGAG GGTGGTGGTC TTGTTGAGTT GAGGAGTTAT CCCCGCTTGC GTCGCTACCT GGAGGCTCGA CGAGACGTTA TTATGGACCG ACATTGTGCA CGAAAGTCTC CGGATAACTG GTATCGCACA ATCGACAGGA TCACCCCTGC ACTGGCATCA AGGCCGAAAC TCCTGATTCC CGATATCAAG GGAGAGGCTC ATGTCGTTTA TGAAGGAGGT GCACTTTATC CTCACCATAA TCTTTACTAT GTTACGTCCG ACGAGTGGGA GTTGAGGGCA TTGCAAGCTG TGTTGCTTTC AGACGTCACC CGTCTGTTCA TGTCAACCTA CTCCACCAGA ATGCGTGGAG GCTACCTCCG GTTTCAGGCG CAGTACCTGC GTCGCATCCG CATTCCTCAA TGGGCGAATG TTTCGATAAT GTTGCGGAAG GCGCTTGCGG AAGCTGCAAT CATGAGGGAT TTTCCTGCTT GCAATCGCGC AGTATTCAAG CTATTTGAAC TTGGCAATAA CGAACGATCC GCCCTTGGAG GCAATGGAGA ATAA
|
Protein sequence | MVVAAIEKLS SAAGTDSRGA IFTRVEVVDF ILDLAGYTED QLLHEKRLLE PSFGGGDFLL PAIDRLLAAW KSSQHVTSVV EELGDAIRAV ELHHKTFSST RAAVIEKLRL ESIDVQCAAT LADRWLVHGD FLLSPLDGHF DFIVGNPPYV RQELIPAALL AEYRNRYQTM YDRADIYIPF IERSLSVLGK GGSLGFICAD RWMKNRYGGP LRSLVANQFH LKIYVDMVDT PAFLSDVIAY PAITIISRET PGATRIAHRP AIDRDTLANL ADALCASNQL NEVDSISELA QVTNGAEPWL LESSDQMALI RRLEKQFPSL EEVGCKVGIG VATGADKAYI GDFDTLDVEP DRKLPLVTTK DILSGEVQWR GQGVINPFVE GGGLVELRSY PRLRRYLEAR RDVIMDRHCA RKSPDNWYRT IDRITPALAS RPKLLIPDIK GEAHVVYEGG ALYPHHNLYY VTSDEWELRA LQAVLLSDVT RLFMSTYSTR MRGGYLRFQA QYLRRIRIPQ WANVSIMLRK ALAEAAIMRD FPACNRAVFK LFELGNNERS ALGGNGE
|
| |