Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1414 |
Symbol | |
ID | 6375092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1532651 |
End bp | 1534558 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642683909 |
Product | hypothetical protein |
Protein accession | YP_001959823 |
Protein GI | 189500353 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.751041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.141575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAATA TACGCATTGG ACAGCTGATT ACTCCTTTTG GACCAGGAGC ACTTTATACA GATTCAAAAG GTACCTCGCT GATTATCGGA GGACTCGATC ACTGGTATAA ATCGGATCAT CAGACAGGGG AGATTCATAT AGATGAATTT TCGATTTTCG AGCCAAGGCT CTCGGTATTG TTAGGGGTCG ACAGGTTTCG CAAGCCTGCC GATTTTCGAC TTGATCGCAC AAATCAGAAT GCAAGGATTA CTACCCCGGT TTTAAGGTTT CCGACTTGGT ATCGTGAAGT TTATACAGGC CGGCTGAAGC AGGTCAATCT GGAGTCTATG ATTGTTAATG CGCAAAGAAA TGAACGCTGG GTTCCAGTAA GGTTTATTTC AGCCTGCAAG GCGGGGCATC TTGGTGATTT TCCATGGAAA CAATGGGTCG GCTGTAAATG TCAGAGTAAT GGGAACTTAT ATCTCCATGA TGCTGGAGGG GCAGACCTTG CCAGCGTTTG GGTGGAGTGC CGGACTTGTA ACAGAAGAAA ATCCCTTGCT GGAGTTACAT GGCTGGATGG GGAAAAAGAG GCAGGTAAAC AGAGTGCATT TCAAAGCAGT GGTATTGTTT GCCAGGGTTA TCGTCCCTGG CTTGGAGATT TAAGACGTGG CGAGGGTTGT CAGGAACCTC TTGTAGGCGC ACTGATTAAT CAGAGCAATC TGTATTTTGC CAAAGTAATG TCATCCATTT CACTGCCGGA CCTGTCCATT GAAGATGAGT CAGTCGCTAA CCTGAAGAAT GATATAGAAA AGTGCGAACA CAAAATTGCT ACGGCAAAAG CCCTGTGGAG AATTGGAGAC AAAAATGATG CGATATTAAT AATAGACTCT GCACTGAAAG AAAACGGGAT TGAAGCTGAA CAGAATAAGA TCGAGAGTGT TCTTGAGCGT TTATTTTCCG ATTCCTCTGT TTTGACTGAG CGAAACAATA TGCCTGAGCA ACCTGACGCC CCGGATCTCG CGTTCAGGCG TGATGAATTC AATATCCTCA GAACTAGTGT TAACGATGAC CTTAGGTCAC GGGATCTCAG GGTAATACCG TCGGAAGTCC CTGATAATCT TGTCAGGTGG ATTGGTAAGG TGCATCTTGT AGAACGCCTG CGTGAAACCC GAGCTTTTTA TGGCTTTAGC AGATTGGTGC CGGATGAACA CCCCCTTGAT GATATGCCTG AAAATGCATT GAAGCAACTT TTTCTTCATC CACCGCAACA AAATATTGAG AAATGGCTCC CTGCTATAAC AGTTTTTGGA GAAGGAATCT ATATCGAGCT TGCTGAAGTT GAAATAAATG CCTGGATACA TGCCAACTAT GAATGGTTGG TTCATCGCTT AAACGAGGCT TTCAGGGTAA GGCTTGCAAA TGTTTATCAG GTATATTCTC CGCAAAAAGG ACCATCGATG GAGTGGGCTG CAAGATACTT GCTTGTTCAC AGCCTTTCTC ATATTCTTAT TAATCAGTTG GTATTTGAAG CTGGTTACAG TTCGGCATCC TTGAAAGAGC GTCTGTATGT ATCATCTGAC AGGAGAGCTC CCATGGCAGG GATTCTGATC TATACAGCAG CCGGTGATTC TGAAGGAACA TTAGGTGGGC TGGTCAGAAT CGGGCATAAG GATCGTCTAG GGCCGGTGAT CAGGAAGGCT GTATCCCGTG CTTCATGGTG TTCAGCAGAT CCAGTCTGTT CTGAAAATCT TGGTGGACAG GGAACGCGCC TTGCCAATCT GGCCGCATGC CATGCATGCA TCATGTTGCC TGAAACTGCC TGTGAAACCA TGAATAACGG GCTTGACAGG GCAATTGTAA TTGGAACGCC TGATGAGCGT GAACATGGTT TCATGTCAGA GCTGCTCAGT GAAATTTCTG CGGCTTGA
|
Protein sequence | MENIRIGQLI TPFGPGALYT DSKGTSLIIG GLDHWYKSDH QTGEIHIDEF SIFEPRLSVL LGVDRFRKPA DFRLDRTNQN ARITTPVLRF PTWYREVYTG RLKQVNLESM IVNAQRNERW VPVRFISACK AGHLGDFPWK QWVGCKCQSN GNLYLHDAGG ADLASVWVEC RTCNRRKSLA GVTWLDGEKE AGKQSAFQSS GIVCQGYRPW LGDLRRGEGC QEPLVGALIN QSNLYFAKVM SSISLPDLSI EDESVANLKN DIEKCEHKIA TAKALWRIGD KNDAILIIDS ALKENGIEAE QNKIESVLER LFSDSSVLTE RNNMPEQPDA PDLAFRRDEF NILRTSVNDD LRSRDLRVIP SEVPDNLVRW IGKVHLVERL RETRAFYGFS RLVPDEHPLD DMPENALKQL FLHPPQQNIE KWLPAITVFG EGIYIELAEV EINAWIHANY EWLVHRLNEA FRVRLANVYQ VYSPQKGPSM EWAARYLLVH SLSHILINQL VFEAGYSSAS LKERLYVSSD RRAPMAGILI YTAAGDSEGT LGGLVRIGHK DRLGPVIRKA VSRASWCSAD PVCSENLGGQ GTRLANLAAC HACIMLPETA CETMNNGLDR AIVIGTPDER EHGFMSELLS EISAA
|
| |