Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1699 |
Symbol | |
ID | 3746388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2206529 |
End bp | 2208310 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637774236 |
Product | hypothetical protein |
Protein accession | YP_379993 |
Protein GI | 78189655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00360282 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGCT TCTGCCAACA ACTTTTTTCA TGCTTTAGCT TAACTACCCG ACGCACCATG AAACCAACAT TTCATTTCAC CCTTATTTGT TGTGCTGTTG CACTTTTGGC AAGCAATCAA CAAACCTTTG CAGGTAATGA CGTGCTGCAT CAAAATAGTG CCTCGCTCCA AAGTTCACGC TTGCAAAATG AGCCACAACG TGCGCCAACG CCATCGGTGA TTGCCTTTCA ACCCAATCAA AAAAGTGTTA CAGGAACCAT TGGCAACGAT CGCATTACGC TCTATGGCAA TAGTGGTTTG GCAAGTACCG CAACACCCGT GCGCGATCCC GCAATTTCGC GCTCGCAAAC CTTTAGCTCT ACCGTGGCGA TTGATTTAAA TGGTACCATT TTGCCCGGTA CAGCGGCTGA AACCCCTACG GTACATTTTT ACATTAATGG CAAAGATTAT GGCTTAGCTA CGCTCAGTAG TGTACAAAGT GATTACAGTA AAAAAATTGG CGGCATTGCC CACAGTGGCA AGCAACGCTT TATTTTTCCT GTAGATGATA TTGATATTCG CACCATCAAA ATTGAAATTG AGTCGCCTGC CGTACTGCGT TCCGAAGTCT ATATTTATGG CGTTACCATT ACGCCCGAAG GCGCGGCTGA CCAAAAAGTT GAACCCAGCT CGTTGCGTGG TGCCACGGTA ACCTTTGCCA CGCCATCGCG CTATTATGAT GGTGGAAAAA GCTATAAAAT CCCTTACGGC TCAATTCCAA GCGATGTTCG ATCTATTACT ATTGATACCT CACTCTATCG CAAAACATTG CAGCAAGCGC CTGGCACTCC AGCAAATCCA TTAACAGTGC ATGGCGGTGG CGGCATTGAT ACACTTTACT TGCTTGGTAA CCAAGATCAG TACGTGTTGG CAGGTGGTAA AAATAGTTCG CTTATTATTG CTGAATCGGC GGGGTTAAGC CAAAATGCGC TCGCCACTAA CATTGCAAAA GTTGAATTTG CTGATAGCTC TTTCTTTTTA CCTCCGCAAG CTACCGGCAT CAATGAAGCG GTATTGGCAG AAAACGAGGC ATCAATAGCT ACACTTCGCG CCACCTTGCC ACCCCATGTG GGCATGGCAG TTCATCCCAT AAAGCATGAC CCGCTACGTG GCAAAATAGG CGATGTATTA AGCAAAAATT GCCCCACCGA GTTTTACCAA AATGCTGTGC GTTTAACGGC ACCGCAAGGT GATGCTCCCA TGGCTCTACG CTCTTTGGGA TTTGTTATTG CACCAGCACG CCGTGATGCC CCTACCATTA AGTTGCAAGG TGCTGCAAAA GAGCAAGAAA TGGTGGTAGT TGATACAGCC ACTTTGCCAG AAACTTCATT GATTGAGGCA CAGCGCGTTA ATGTTGTTTT GTTGAGCGGC AAAACGCCGA TAACGTTTCG CGGAAATGGT GATGGCATGG TAATTCTTGC GGACGAAGGC AATCAAGAGA TGTGGGGCGG CACGGGCGAT GACCTATTAT GGGGAGGTGT GGGAAATGAT AATTTGTATG GGGGAATTGA TGATGACCTG CTTTGCGGCG GCAGTGGCGA TGATGTGTTA GATGGAGGTT CAGGCATTGA TGCCGCCTAC TTTAGCGGCA AAAGCGAAGA GTACCGCATT ACGCACCATC CCACAACCAG CATGACCACC GTTGTGGATT TGGTTGCTGA ACGCGATGGT ACCGATAACC TCTTTAATAT TGAGCAATTA CGTTTTGCCG ATAGAACTAT GTTTTTAGGC GATAAAAAGT AG
|
Protein sequence | MDSFCQQLFS CFSLTTRRTM KPTFHFTLIC CAVALLASNQ QTFAGNDVLH QNSASLQSSR LQNEPQRAPT PSVIAFQPNQ KSVTGTIGND RITLYGNSGL ASTATPVRDP AISRSQTFSS TVAIDLNGTI LPGTAAETPT VHFYINGKDY GLATLSSVQS DYSKKIGGIA HSGKQRFIFP VDDIDIRTIK IEIESPAVLR SEVYIYGVTI TPEGAADQKV EPSSLRGATV TFATPSRYYD GGKSYKIPYG SIPSDVRSIT IDTSLYRKTL QQAPGTPANP LTVHGGGGID TLYLLGNQDQ YVLAGGKNSS LIIAESAGLS QNALATNIAK VEFADSSFFL PPQATGINEA VLAENEASIA TLRATLPPHV GMAVHPIKHD PLRGKIGDVL SKNCPTEFYQ NAVRLTAPQG DAPMALRSLG FVIAPARRDA PTIKLQGAAK EQEMVVVDTA TLPETSLIEA QRVNVVLLSG KTPITFRGNG DGMVILADEG NQEMWGGTGD DLLWGGVGND NLYGGIDDDL LCGGSGDDVL DGGSGIDAAY FSGKSEEYRI THHPTTSMTT VVDLVAERDG TDNLFNIEQL RFADRTMFLG DKK
|
| |