Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1047 |
Symbol | |
ID | 3747775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1416029 |
End bp | 1417507 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637773576 |
Product | hypothetical protein |
Protein accession | YP_379352 |
Protein GI | 78189014 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000670981 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGCC CTTACTACAA CCGACCGCCA ACATCATCGT TACCATTCCC CACCGCCGTA GAGACAACGC ATGCGTTGTC TCTACAATCC ATAAAAAAAT TTCCTACCTT AACACTATAC AACCTCCACG CCTTTTTCCA CGTTAATTTA TTAACGATTA TGGCTACCAA CTCTTCCTCC CCGCTTGTGC GTCGTGAATT GTTCATTTTC GATGCTTCTG TTTCTAACCT TTCAACGCTT TCTTCTGCAT TATCTGCTAA CAGCAGCTAT TTTGTGCTTG ATTCCACGCG CGATGGTTTG GTGCAAATTG CTGATTTGCT TGCGGGGCAA ACGGATATTG ATTCCCTCCA TATCTTTAGC CACGGCAGTG CTGGTTCGTT GCAGCTTGGC AACTCCTCGC TTTCGCTTGT AAATCTGAAT AACTACGAAT TACCGTTGTC GGTGATTGGT TCATCGTTAT CATCAAGTGG TGATATTTTG CTGTACGGTT GCAATGTTGG TGCCGGTGAT GAAGGACTTG CGTTTGTTGA TAAACTTGCG AAAATGACGG GCGCTGATGT TGCGGCTTCG GATGACTTGA CGGGTGCAAC GGCGCTTGGT GGGGATTGTG AGTTGGAGGT GGAGAGTGGG GTTATTGATG AGGCTTCATT TTATTATGCC CCTGAATATG CTGGACTTCT TGGAGCTGTG GGACCAGAGT TTCATGTTGA CACCTCTGAT ATTCAAGTTT GGTCATATGA ACCATCCGTG GCTGCTTTAG CTAATGGTGG TTTTGTGGTA ACATGGATTT CGGAAACCTT GGAGACACTG TCCTCAGACA CCCATACCGA TATACATGGA CAGCTATACA ACAGTGAGGG TGCAATGGTT GGATCGGAAT TTCAGGTTAA CACTTACACT CAATACGGTC AATACACTCC TTCTATAACC GCTTTAGCTG ATGGTGGTTT TGTGGTAACA TGGATTTCGG AAACCTTGGA GACACTGTCC TCAGACACCC ATACCGATAT ACATGGACAG CTATACAACA GTGAGGGTGC AATGGTTGGA TCGGAATTTC AGGTTAACAC TTACACTCAA TACGGTCAAT ACACTCCTTC TATAACCGCT TTAGCTGATG GTGGTTTCGT AATTATATGG AGATGCGTAA ATAATGACGA CTATAACTGT AACTATATAC ATGGCCAGCG CTATAATGCT GATGGGATAA TGGTTGGTTC AGAGTTTCAG GTAAACACCT ATACTCAAAT TGGGGCATAT GAACCATCCG TGGCTGCTTT AGCTGATGGC GGTTTCGTGG TAACATGGGA ATCAGGAATA GTAACAACAT GGAAGTCAGG ATATCAGGAT ACTTCCAATT CAGATATTTA CGGTCAAATA TTCAATGTTG ATGGAGCAAT GGTTGGCTCG GAATTTCGGA TCAACACCTA TACAAAAGGT TTTCAAGGCT GTCCTTCTGT GACCTCCCTT ACTAATTAA
|
Protein sequence | MNCPYYNRPP TSSLPFPTAV ETTHALSLQS IKKFPTLTLY NLHAFFHVNL LTIMATNSSS PLVRRELFIF DASVSNLSTL SSALSANSSY FVLDSTRDGL VQIADLLAGQ TDIDSLHIFS HGSAGSLQLG NSSLSLVNLN NYELPLSVIG SSLSSSGDIL LYGCNVGAGD EGLAFVDKLA KMTGADVAAS DDLTGATALG GDCELEVESG VIDEASFYYA PEYAGLLGAV GPEFHVDTSD IQVWSYEPSV AALANGGFVV TWISETLETL SSDTHTDIHG QLYNSEGAMV GSEFQVNTYT QYGQYTPSIT ALADGGFVVT WISETLETLS SDTHTDIHGQ LYNSEGAMVG SEFQVNTYTQ YGQYTPSITA LADGGFVIIW RCVNNDDYNC NYIHGQRYNA DGIMVGSEFQ VNTYTQIGAY EPSVAALADG GFVVTWESGI VTTWKSGYQD TSNSDIYGQI FNVDGAMVGS EFRINTYTKG FQGCPSVTSL TN
|
| |