Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1722 |
Symbol | |
ID | 3746505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2236557 |
End bp | 2239598 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637774259 |
Product | hypothetical protein |
Protein accession | YP_380016 |
Protein GI | 78189678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.919333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTG ATGAACTATA TCAGGTATTT CTTCCAACCT TTACCATGGT AGAAGAAGAA TGGAATAATT TTTTAGCTGA AAAGAATACT GCGCTCTCAG AAGCTCAAAA CTATCTTAAT GCTATTTTTT CTATTACAGG CTACAATTTA CCTACCTACG AAGAAATATC ATCAACGCGC TATCTTGGAG TCTATGAAAA TGGTGCGCAA GTAACCTTTG AGGGTAGTGG TATTGATAAG CTCTTTGGTG GCGATATGAC TACCGGTACT CTCTCACTAT CTCGCATTGC TCTTGATTCG TATGCCAATA ACATCCACAT GGCAATGCTT GGCTACAACA ATGGCATTAC GGTTGATTTA AACAGTGGCG CTGTTAGTGG CATGTTTAAT GAATATTCGC TTACTACGCC ATGGTTTGAG CTATCTGCGG TTGGTACGGT TGCGGTTTCA GGTGCTTCAA TTCTTTCGGA AATGAATATT GAGGCTGAAC TGACGGAGCT TTCTTTTACT TATCCTGATG ATGGTGTTAT TGTAAAGCTG CTTGGTGATA TTGATTATTA TCAAGATGAG CTTGGTAATA TGGAGTATAG CGGAGATGTT TATACTGCGT ACTTTACGGG ATGGGGTGCC GATATTACTC TGAATGGAGA TTTTCAGTGT GATTTTGATA TTAATAATAC TTTTATATTA TTTAGTGATG AACTAACAGG GGAGCTTTAT GTATCAGAAC TTTCTCTTGT AATTCCATCT CAACATGTTG TTGCTAATTT TTATAATTCT TTAACAGGAT TATCGTATGA TATTACCTCT GGTTCTCTTG GTGGGAGTTT TAATTCATTC CATTTTGGAA CTTCGTTATT TGATGTTTAT GCACATGGCT CAATTTATGT AGAGCCTGTT AGCTCATCAT CTACTTTAGG TATTCATGGA ATATTGGATA GCATTGAAAT AACCTATCCT GAGAGTACGT TTGCAATAAC TGTTGTTGGG GATGTAGAGT ATATCCAAAT TGATCAAGGT GAATATACCT ATTTTGGAAC TATGTCGGAA GTATATCTGG AAAATCCTAC AACCACAGTT TCTGTACTTG GTGATTTTTC AGGTTCATAT AATGACTCTA ATGGATTACA TCTTGCAGGA AATTTGTATG AATTCCATTG GCAGCGGGAA GAGGCATTTA TTTCATTTGT TGGTGATATA GTATTTGGTG AAGATCAGCT TGTTGTTAAT GAGGTTACAA CTCTTGAGGT GTATGGGGAT GGACGATATT ACGATGCTTC ATCGCTTTCT GTAATGAATA TAGTTACTGA TGTTGTAGGT AATGAGTTGC TTGCTCTTGG TGAAGATGCT AATTGGGATA TAAGTGCAGC GTTGGATGAA TTGTTGTGGG ACGTTATCAA TAAATTGAAT GGTGATGCGG AAGGTGTTAG TAGTGTAAAT TTTGATAGTG TTCCCGCAAG TAGCACACCT GTAAATGCTG AATTTCTTGA TTTCTATCTT GATTTATCAA AAGTAGGTGA AGCGGGGTAT TATGCTTCGT TCCGTGTAGG TCATTTGTAC GATACTAACG GCGATGGTTT ACCAGATTAC GTAGATGAAA TTCATGATAG CCCAGCTACC ATTACATGGA ATAATGGAAT GTTTACAGTT CTTAGTCTTG ATGATAGTAG CACAAGGGCT ACCGGTTCTT TGGCTTATGA TGGTAATGGA AACGCCGTTG GATTGTATGC CTTTGATCGT GCTTCTGGTG ATAGCGAAAC CACTCCTCCA ACCCTCATCG CTGCTACCCC ATCGGATAAT GCTATGGGCA TAGAGGTTGA AAGTGATCTA TCATTTATCT TTAGCGAAAA TGTACAATTT GGTAACGGTA CTATTGAAAT TCACAGAGGT TCTGCCACGG GTGAATTTGT CGAGAGTTAC AATATCGGCA CTCCACTCAG CACGAATCTC AACATTGTTG GTTCTACGCT TACTATTAAT CCAACAAGCG ATTTAGCAAG CAATACCCAC TACTTTGTAA CCTTTAGTGA AGGAAGTATT CGAGATTTAG ATGGCAATAA TTACGTTGCA TCGCAACCTT ACGACTTTAC TACAGGCGCT GATCCTTATC CAACGCACAC CCTCACAGGC AACATCACCT TCTGGAAAAC CGGCGAAGCC ATTACCGACG TAACAACAAC CCTTACCACC TTGCCAACCA ATGGCACACA CGCCATTGAA CTTAAAAACA TTCACGTTCA AGCCAATGGC AGCCACACCA TTGAAGTATG GGCAACCACG CCAAACAGCA CCACTGGTAG CTTTGAATGC GAATTTGCCT TGCCTACAGG CACAAGCGTA ACGTGGCAGG ATGCCGCTAA ATTGCCTTCA GGCTGGATGA CAACCAACAA CGTCATCGCC ACCGGTGCGT TCCGTGTTGC GAGTATTGGC ACCCATGCTT TAGCTGAAGG TGCAGTACAA CTTGGCACGC TCACCATTAG CCAATCCGCA AACCCCGGCA CCTTTGAACT CGCTATGACG CACGCCCAAC TTGGCAACAA CGATGTTGCC GGCTATGCAA TCAGCAGCGT TAGCTCCACC ACCGGCAGCG GCAATGAATA TCAGTACCAT TCGCTTACCG ATGGGCACTA TGCGCTCACG GGCGACAAAG CCGCAGGAGA TGCAGGCAGC GCCGTACACG CTAACGACGC ACTTGCCGCC TTAAAAATGG CAGTGGAACT TAATCCCAAC GAAGCCAACG CAAATGGGTT GCTCGGTCCC GTTTCACCAT TCCAATACCT TGCTGCCGAC ATCAACCGCG ACGGCAAAGT GCGTGCAAAC GATGCCCTCA ATATTTTGAA AATGGCAGTT GGCATTGAAT CAGCACCAAC CGACGAATGG ATTTTTGTTG CCGAATCCGT TACCGGCAAA ACCATGGATC GCAGCCATGT TGACTGGTCA GACATCAGTC CTATTGTGGA CTTCAACCAA ACCGCCATTG AACTCGACCT CATCGGTATT GTCAAAGGCG ATGTTGATGG CAGTTGGGTA ATGGTGGGAT AA
|
Protein sequence | MKLDELYQVF LPTFTMVEEE WNNFLAEKNT ALSEAQNYLN AIFSITGYNL PTYEEISSTR YLGVYENGAQ VTFEGSGIDK LFGGDMTTGT LSLSRIALDS YANNIHMAML GYNNGITVDL NSGAVSGMFN EYSLTTPWFE LSAVGTVAVS GASILSEMNI EAELTELSFT YPDDGVIVKL LGDIDYYQDE LGNMEYSGDV YTAYFTGWGA DITLNGDFQC DFDINNTFIL FSDELTGELY VSELSLVIPS QHVVANFYNS LTGLSYDITS GSLGGSFNSF HFGTSLFDVY AHGSIYVEPV SSSSTLGIHG ILDSIEITYP ESTFAITVVG DVEYIQIDQG EYTYFGTMSE VYLENPTTTV SVLGDFSGSY NDSNGLHLAG NLYEFHWQRE EAFISFVGDI VFGEDQLVVN EVTTLEVYGD GRYYDASSLS VMNIVTDVVG NELLALGEDA NWDISAALDE LLWDVINKLN GDAEGVSSVN FDSVPASSTP VNAEFLDFYL DLSKVGEAGY YASFRVGHLY DTNGDGLPDY VDEIHDSPAT ITWNNGMFTV LSLDDSSTRA TGSLAYDGNG NAVGLYAFDR ASGDSETTPP TLIAATPSDN AMGIEVESDL SFIFSENVQF GNGTIEIHRG SATGEFVESY NIGTPLSTNL NIVGSTLTIN PTSDLASNTH YFVTFSEGSI RDLDGNNYVA SQPYDFTTGA DPYPTHTLTG NITFWKTGEA ITDVTTTLTT LPTNGTHAIE LKNIHVQANG SHTIEVWATT PNSTTGSFEC EFALPTGTSV TWQDAAKLPS GWMTTNNVIA TGAFRVASIG THALAEGAVQ LGTLTISQSA NPGTFELAMT HAQLGNNDVA GYAISSVSST TGSGNEYQYH SLTDGHYALT GDKAAGDAGS AVHANDALAA LKMAVELNPN EANANGLLGP VSPFQYLAAD INRDGKVRAN DALNILKMAV GIESAPTDEW IFVAESVTGK TMDRSHVDWS DISPIVDFNQ TAIELDLIGI VKGDVDGSWV MVG
|
| |