Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0843 |
Symbol | |
ID | 3746802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1171771 |
End bp | 1175148 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637773372 |
Product | hypothetical protein |
Protein accession | YP_379151 |
Protein GI | 78188813 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGC CGCTTGAAAT GGGATTTGTT GCCGATGACC GCTTGGCTGG TTTTCGACTG CAACGGCTTG AAATATTCAA TTGGGGCACC TTTGATGGCA GGGTTTGGAC GCTCAAATTG GGTGGAAAAA ATGGTTTGCT TACGGGCGAT ATTGGCTCTG GAAAATCAAC CGTTGTTGAT GCTGTAACCA CACTGTTAGT GCCAAGCCAG CGCATTGCTT ACAACAAGGC GGCAGGTGCC GATAACAAAG AACGCACCTT GCGCTCTTAC GTGCTGGGCT ACTATAAATC GGAGCGGCAG GAGAACCTTG GCGGTGGTGC AAAACCCGTT GCTCTGCGCG ACCTCAACAG CTATTCGGTG ATTCTCGGTG TTTTTCATAA CGAAGGTTAC GACAAAACGG TAACGTTGGC GCAACTCTTT TGGATGAAGG ATGCTGCTCA ACCTGCCCGC CTTTATGCTG CGTATGAAGG CGACCTCTCC ATTGCCACCG ATTTTTCTAA CTTTGGTGCT GAAATCGCGA CGCTGCGCAA GCGCCTTCGT GGTGCGGGAG TTGAGCTATT TGAGAGCTTT CCACCCTATG GCGCATGGTT TCGGCGACGC TTCGGCATTG AGAACGAGCA AGCATTAGAG CTGTTTCATC AAACCGTTTC GTTAAAATCT GTAGGAAACT TAACGGACTT TGTGCGCCTC CACATGCTTG AACCCTTTAC GGTTGAGCCA CGTATTGCCG CCCTTATCCA CCATTTTGAA GATCTTAATC GGGCGCACGA AGCGGTGCTC AAGGCAAAGC GGCAAATTGA GATGCTTGCA CCTTTGGTGG CAGATTGCGA CAATCACCAC GCTATGGCGC AAAGAACAGA AGAGCTGCGG GCATCGCGCG ACTGCCTTCG CCCTTGGTTC GCTTCCCTTA AACTTGAATT GCTCGAAAAA CGCCATACTT CGCTTAATGA GGAGCTAAGC CGCCACCAGA TCGCCATTGA GCGTCTGGAC GGGGAGCGCC GCACCTTGCA GGGGCGCGAT CGTGAGCTGC GCCGAACTAT TGCTGAAAAC GGGGGCGATC GAATTGAGAG TATTGCGGCT GAGATTCGGC AGCATCAGCA AGAGCTGGAT CGGCGCACCC AAAAATCAAC CCGATATAAA GAACTTTTGA GGCAGCTTGG GGGGCAGCTT GGCGAGCATC CCGCAACAAG CGCTGAGGAG TTTTATCAGC AGCGAGCAGA ACATGCCGCC ATGCACGAAA GTGCGGCTGA AGCTGAAGTA CAGGTTCAAA ACAATCTGAA CGAAGCAGGT GTGCTCTTTA CCCAAGGGCG TCAGGAGTAT GAGCAACTGA GCACCGAAAT CAAGAGCTTG AAAGCACGCA TGAGCAACAT TGATGAAAAG CAAATTGCCA TGCGCCATGC GCTCTGCAAG GCGTTGAATT TGCCTGAGGT GGAGATGCCT TTTGCGGGCG AATTGCTGCA AGTTCGTGAA GAGGAAACAG CTTGGGAGGG AGCTATTGAA CGGGTGTTGC GCAACTTTGG GTTGTCGCTT TTAGTGCCCG ACCACCATTA CCCAAAAGTG GCGGAATGGG TGGAGCGCAC CAATCTGAAA GGCAAACTTG TCTATTTTCG AGTTCGTCCG CAATCTCGCA ATGAGCAGTT GGCGGATCAT CCCGCTTCAC TTGCCCGCAA GCTTGCCATC AAAGCCGATT CCACCTATTA CGATTGGATT GAACGGGAAG TTTCCCATCG TTTTGATCTG ATTTGCTGCA CCACACAAGA GGAGTTTCGG CGGGAGAAAA AAGCCATCAC GCCAGCGGGA CAAATTAAGT CGCCGGGTGA ACGGCACGAA AAGGATGATC GCCATCGTCT TGACGACCGT AGCCGCTACG TGCTTGGGTG GAGCAATGCC GCTAAAATTG CTGCCCTTGA AGCAAAGGCG AAGGTGCAGC AACGTGAGCT TACCAAACTT GCCGAACGCA TCAGCACGTT GCAACAAGAG CAAAAAGCGC TTAAAGAGCG GCTAACCATT CTCTCCAAGC TTGATGAATA TTCCGATTTC AACGACCTCG ATTGGCAGCC ATTGGCAGTT GCTATTGCGC GATTAGAAAA GGAAAAAGAG GAGCTTGAAA AAACCTCCAA TATTCTGCAA ACCCTTACCG AGCAGCTTGC TTTGCTGGAA CAAGAGCTGC AAAAAACGGA GCAGCAGCTT GACGACCGAA AGGATAAACG CTCTAAAATT GAGCAGAAAA TTAGCAGCAT CACTGAGTTG CAGCAACAAA CGGCGGCACT GCTTGCAGAA GCAGGAGACG AAGTAAGCAA CCGTTTTGCC CTCCTTCAAG CAATGCGCCA AGAGGCTTTT GGCGATCAAT CGCTAACGGT TGAATCGTGT GACAATCGCG AGCGGGAGAT GCGTGAATGG TTGCAGAATA AAATTGACAG CGAAGATAGA AAGCTCTCTC GATTGGGTGA AAAAATCATT CGGGCAATGA CCGAATACAA GGAGGAGTGG AAACTTGAAA CCCGCGACGT GGATGTTAAT ATAGCTGCCG GTAAAGAGTA TCGTGCCATG TTTGAGCAAC TTCAGGCTGA CGATTTGCCT CGCTTTGAGG GGCGCTTTAA AGAGCTGCTG AATGAGAATA CCATCCGCGA AGTTGCCAAT TTTCAGTCGC AACTTGCCCG CGAACGCGAA ACCATCAAAG AGCGTATTGT TCGCCTCAAT GAATCGCTAA CACAAATTGA TTTTAATCCT GGGCGATACA TCACCCTTGA AGCTGAGAAT AGCCTTGATG CCGACATCCG TGATTTTCAA ACGGAGCTTC GTGCTTGCAC CGAAGGAACG CTGACAGGCT CGGATGATGC GCAATATTCG GAAGCCAAAT TCCTGCAAGT ACGGCGCATT ATTGATCGCT TTCAAGGGCG CGAAGCTTAT GCTGACCTCG ACCGTCGCTG GACAGCCAAA GTAACGGATG TGCGCAACTG GTTTGTTTTT GCCGCCAGCG AACGGTGGCG CGAAGATGAC ATTGAGCACG AACACTACGC CGATTCAGGT GGTAAATCGG GAGGGCAAAA GGAGAAACTC GCCTACACCG TGCTTGCCGC CAGCCTTGCC TATCAATTCG GCTTGGAATG GGGCGCCGTG CGCTCCCGCT CTTTCCGCTT CGTGGTGATT GACGAAGCTT TCGGACGCGG CTCCGACGAA TCGGCACAAT ATGGATTGCA ACTCTTCGCC CAACTTAACC TGCAACTGCT CATTGTTACG CCATTGCAGA AAATCCACAT TATTGAACCC TTTGTTGCCA GCGTTGGCTT TGTGCACAAT CAAGAAGGGC GCTGCTCAGT GTTACGCAAC CTCACCATCG AAGAGTATCG CTCCGAAAAA GAGAGGGCAA TTGCATGA
|
Protein sequence | MSEPLEMGFV ADDRLAGFRL QRLEIFNWGT FDGRVWTLKL GGKNGLLTGD IGSGKSTVVD AVTTLLVPSQ RIAYNKAAGA DNKERTLRSY VLGYYKSERQ ENLGGGAKPV ALRDLNSYSV ILGVFHNEGY DKTVTLAQLF WMKDAAQPAR LYAAYEGDLS IATDFSNFGA EIATLRKRLR GAGVELFESF PPYGAWFRRR FGIENEQALE LFHQTVSLKS VGNLTDFVRL HMLEPFTVEP RIAALIHHFE DLNRAHEAVL KAKRQIEMLA PLVADCDNHH AMAQRTEELR ASRDCLRPWF ASLKLELLEK RHTSLNEELS RHQIAIERLD GERRTLQGRD RELRRTIAEN GGDRIESIAA EIRQHQQELD RRTQKSTRYK ELLRQLGGQL GEHPATSAEE FYQQRAEHAA MHESAAEAEV QVQNNLNEAG VLFTQGRQEY EQLSTEIKSL KARMSNIDEK QIAMRHALCK ALNLPEVEMP FAGELLQVRE EETAWEGAIE RVLRNFGLSL LVPDHHYPKV AEWVERTNLK GKLVYFRVRP QSRNEQLADH PASLARKLAI KADSTYYDWI EREVSHRFDL ICCTTQEEFR REKKAITPAG QIKSPGERHE KDDRHRLDDR SRYVLGWSNA AKIAALEAKA KVQQRELTKL AERISTLQQE QKALKERLTI LSKLDEYSDF NDLDWQPLAV AIARLEKEKE ELEKTSNILQ TLTEQLALLE QELQKTEQQL DDRKDKRSKI EQKISSITEL QQQTAALLAE AGDEVSNRFA LLQAMRQEAF GDQSLTVESC DNREREMREW LQNKIDSEDR KLSRLGEKII RAMTEYKEEW KLETRDVDVN IAAGKEYRAM FEQLQADDLP RFEGRFKELL NENTIREVAN FQSQLARERE TIKERIVRLN ESLTQIDFNP GRYITLEAEN SLDADIRDFQ TELRACTEGT LTGSDDAQYS EAKFLQVRRI IDRFQGREAY ADLDRRWTAK VTDVRNWFVF AASERWREDD IEHEHYADSG GKSGGQKEKL AYTVLAASLA YQFGLEWGAV RSRSFRFVVI DEAFGRGSDE SAQYGLQLFA QLNLQLLIVT PLQKIHIIEP FVASVGFVHN QEGRCSVLRN LTIEEYRSEK ERAIA
|
| |