Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1312 |
Symbol | |
ID | 7268603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 1614918 |
End bp | 1617653 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643566155 |
Product | protein of unknown function DUF87 |
Protein accession | YP_002462656 |
Protein GI | 219848223 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00374873 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000332559 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCAACATG GGTTCGATCA GACTTTTTGG GAACCGAAAA ATAACAAATT TTGGGAACCG AAAAATAACG AATTTATAAC ACCACGCGCG ATTTTTGGAG CAGAGGTCGA TGGGTTGCCT GATTTCTGGG ATGAAGGATT GTCGTTAGAG GAGCAAAACC AGTTGATGCA GAAAGTGATT GAGCGACAGA AAACATTTGT GCATGCACTT AGTTTTTTGC GAGGTACCGC GCAATTCGAA TTGCGTATGG TCTGCGACCC GCACGCACCG CAGCGTTTAC GGCTATTTTA CCTTGTAAGT GTTGAAAACG CTGGTGATCC GGAACTAGTC TGGCGACAAT TTCATAACGC CTTTCCGTAT GATCTCGACT TTCGCTTGCG TATGCTTACA CGTGAAGAGG TGCAGCAGAT TTGTGAACAG CCGTTTCGGG AGCCAATGAG TTTGACATAC TATGAATGGG TTCAAAGCCC GTTTCTAGTT ACGTTAGGCG CTGAAATTGC AAATTGCTGG GTGGCACCTG AATTGAGTAC AGACGGCTCG CACACTTTAC TACGAACACT TCTCCAACAG GCTTGTCCCA TTATGATTGG ATTTACTGTT GCCCCTATCG AAGTCGATGA GAATGTGACC AATGTTCTAG AAAACTGGCA GAAACTATTA GAAAGAGTTG AAATAAATCT CTCAATAACT GTGAACGGTT TACGGGATGA TGACCCTCGA AAGATTTTGG TAGACATTCT CAAAGATAAG TTATATCCTA TGAATAAGCC ATTGCTAGAT TATTGGCTTT ATTTGATAAA TGAGGGAAGT AAGACACCAA AATTTCAACT TGAAATAGTA AAGAATGCTG CGCAGCGTAT AATTCAGTCG AAAGATTCTT TGTTCCGCTG GCGAGTGCAT GCGGCTGTGC CGAGTGATCG GACTATTGAT CCGGTCATTG ATCGCGCTGT TACCGATAGG TTGCGTCCGC GCAGTGCTGG CATAGTTAGC TATCAATGTT ACCGGTGCGA GGTAGATATT GATAAATTGC CGGTCAATAA TGTGCGATTC ATTCGTCTTG ATACACCTGA ACGATCTGAT GATGACCCTG CGCGTCTGAT TATTGATGAT ATTGGTGCCG CAGCATTGTT GCAATTGCCA ATCTTGCCAC CAGGTGGGAT TCCCGGTGTT AGATCATTTC CGGCTAACCC TTTCACTTCG TGGCAGCTTG AAGAAGAAGA TGCTGCTCAT AATAGTATCA AGTATATCAA GATTGGAGAG TATATCGACA GTCGAATTGG TTTGTTGGAC AAATCACGAG TTGCTTCTAT CTCGCTCGAT GATTTAACGA GGCATGTTTT GATAACCGGC TCGACCGGTA GTGGTAAATC AACTACAAGT AAACGGCTTA TCACTGAGCT ACACAGGCAT AAAGTACCGT ATTTGGTTAT TGAACCGGTA AAATCTGAAT ATGGTGATTT GGCTTTTGCC GATGAAATAG CTGATCCTCC CTATCCTGAC TTCTTTGTAC CGGGGAGATT TGATGACCCA ATCTGGTTTA ATCCATTCTA CATCCGAAAG GGGGTTAGCC TAAACACTCA TCTTAGTTAC CTTACCTCGT GTTTCGAAGC AGCTTTTCCT CTCAGTGATG TGCAAAGCAT GCTGCTAAAA GAAGTTCTGT ATGAAGCTTA TCGAGAGAAG TTCGCAGAAA AGTCGTTCTT TATTAGTGAC TCTGTACCAA TAGAGCAGGA TTTAGCTGAT GAAGATGTGC CTTCGCTTGA TGATCTTGTG AAAGCTAAAG AAACGATTGA TAAATTTGGA TATAAAGGTG AACTTAAAAG CAATTTAAAA GCAGCAATTG AGTTACGTTT GCAACATCTC CAAAAGGGGA TTATAGGTAG TATTTTGCAG CCTCGGAATA AGGGATTTCC TTCTTTTGAG AATAGATTAA AGGATATTCT GCAAAAGCCA ACAATCATTC AGCTCAATCA AATTGGCAAC AAAGAAGAGA AAGCACTGAT CATGGCTTTT ATCTTAATGG CAATGTATGA ATATTATGAA CAGCAAACGA ATAGTGAATC ACTTCGCCAC GTAACGTTAA TCGAAGAAGC ACACGTATTA CTGGAAAATG TAACACGAGA GAGCAAGGAA GGTTCAGCTA ATACGCGTGG TAAGGCCATT GAATTGTTTG CTGATATGTT GGCTGAAATT CGTTCACGTG GTGAAGGGTT AGTGATTGTT GAGCAGCTGC CATCAAAGCT TATCCCAGAA GCAATTAAAA ATACGAATCT CAAAATCATG CACCGTCTTA CTGCGCGTGA GGACCGTGAC ATTCTTGGAG CAGCCATGAA TTTCAATGAG CGGCAAAGTC GATTTGCCAC TACGCTACAG CGTGGTCAGG CCATCGTTTT TCGTGAGGGT CTAAGTCAAC CAGCATTGAT TCGGGTGATA CCGATTAAGT TGCAGAGTGA TATACGAAGT GAAGTGGATG AAGTTTTTTT GAGCATGGGA AGGTTTGAAC AAAAAGAGAG AGGTAGAGTA GTGTCAGGGT TCAAGTTTCC ACCAGAACTA TTTGACTGTC TACAGCAAGT GTCAGATGGA AAGAAGAATT TAGAACAACT TTTTGATATA ACTGGTAAAT ATCTTTCGAA CAAATCCAAT AAAAAAATAA AGCCTGATGA TAGAAGAGCT ATTAAATGGT TCCTTTACCG GTTGACTCGA CCATATCCAA AATACTTTAA ATTAATCTTT GGTTAG
|
Protein sequence | MQHGFDQTFW EPKNNKFWEP KNNEFITPRA IFGAEVDGLP DFWDEGLSLE EQNQLMQKVI ERQKTFVHAL SFLRGTAQFE LRMVCDPHAP QRLRLFYLVS VENAGDPELV WRQFHNAFPY DLDFRLRMLT REEVQQICEQ PFREPMSLTY YEWVQSPFLV TLGAEIANCW VAPELSTDGS HTLLRTLLQQ ACPIMIGFTV APIEVDENVT NVLENWQKLL ERVEINLSIT VNGLRDDDPR KILVDILKDK LYPMNKPLLD YWLYLINEGS KTPKFQLEIV KNAAQRIIQS KDSLFRWRVH AAVPSDRTID PVIDRAVTDR LRPRSAGIVS YQCYRCEVDI DKLPVNNVRF IRLDTPERSD DDPARLIIDD IGAAALLQLP ILPPGGIPGV RSFPANPFTS WQLEEEDAAH NSIKYIKIGE YIDSRIGLLD KSRVASISLD DLTRHVLITG STGSGKSTTS KRLITELHRH KVPYLVIEPV KSEYGDLAFA DEIADPPYPD FFVPGRFDDP IWFNPFYIRK GVSLNTHLSY LTSCFEAAFP LSDVQSMLLK EVLYEAYREK FAEKSFFISD SVPIEQDLAD EDVPSLDDLV KAKETIDKFG YKGELKSNLK AAIELRLQHL QKGIIGSILQ PRNKGFPSFE NRLKDILQKP TIIQLNQIGN KEEKALIMAF ILMAMYEYYE QQTNSESLRH VTLIEEAHVL LENVTRESKE GSANTRGKAI ELFADMLAEI RSRGEGLVIV EQLPSKLIPE AIKNTNLKIM HRLTAREDRD ILGAAMNFNE RQSRFATTLQ RGQAIVFREG LSQPALIRVI PIKLQSDIRS EVDEVFLSMG RFEQKERGRV VSGFKFPPEL FDCLQQVSDG KKNLEQLFDI TGKYLSNKSN KKIKPDDRRA IKWFLYRLTR PYPKYFKLIF G
|
| |