Gene Cagg_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1487 
Symbol 
ID7267264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1820093 
End bp1823278 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content58% 
IMG OID643566331 
Producthypothetical protein 
Protein accessionYP_002462827 
Protein GI219848394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGTG ATTTCACCCG ATGGACATTC GACCGTACCA AACACTTCAG CGGTGTACTC 
CATCAGCAAG GGCGTGTTGC GCTCGATGCC GATTGGAACG AGCAACTCGC GATCGATCTC
GCCGATGAGC GCGCAGCTCG CATCGATCTG ATCGGACGTT GCGGCGGCCC GCAGGGCCAA
GCCGGTTTTG ATATCGAGGT TGCCGGCGAC ACGCTCAACA TCACTGCCGG TCGCTACTAT
GTCGCCGGTA TTCGCGTCGA AAACGAGCAG ACGGTTGCCA TCACCGCTCA ACCTGATCTG
CCCGTCGCCA CCTTGGCGCA GGTGGCCGGC TTAGCCGCCA ATGCAACCCT GCCCGCCGGC
GTCTATCTCG CCTATCTCGA TGTCTGGGAG CGGCATGTAA CTGCACTTGA AGATGCTGCT
ATCCGCGAAG TTGCGCTCGG TGGCCCTGAC ACCACGACCC GTATGCAGGT AATGGCGCAG
GTCAAGCTCT TGCGTGTAGC CGATCCCGGC GCAGCCTTTG ATTGCGCTAC GTCAAACGAA
GCATGGGATG CGTTGCTGGC CGGCAGTAGC GGTATGCTCG AAGTGCGCGC CGAACCCGAC
CCAACCGTCA ATGATCCCTG TATCGTGCCG GCCCAAGCCG GCTATCGTGG GTTAGAAAAC
CAGCTCTATC GGGTTGAAGT GCATCGGATC GTCTCGGCCA CCCGTATTGC GCTCAAGTGG
TCGCGGGAAA ACGGTTCGGT GGCGGTTGGC TGGAGCGGGC AAGATCCGCT GGATTCCAAT
CGCTTGATTG TCGCCAGTAC TGGCCGCGAC GAGGTATTGG GTCTCGCGCC CAACCAGTGG
GTCGAGCTGA CCGATGATGC GCGTGACAAG CGGGGCGAAC CGGGTCTACT GGTCAAAGTA
ATACAGGTTG AAGGCAATGT CATCACTATT GATCCAAACG GTCAGACCAT CGTCTACAGT
GCATTCGGTC CCAACCCCAA ACTGCGCCGC TGGGATATGC CCGCTGATAC CGGTGAGCTT
ATCGTCACCA CCAGCGCAGC AACGTGGACC GACCTCGAAC AAGGGGTGCA GATCCGGCTC
AAGAACGGTA CGTTCCGTCC CGGCGACTAC TGGCTCATTC CCGCCCGCAC TGTCACCGGC
GATATCGAGT GGCCGCGCTC TGGCAACCAA CCGGTGCCGC AGCTTCCGCA CGGCATCCAC
CATGCCTATT GCAAATTAGC CCTGCTTGAC TTTAATGGCA GCGTGTGGGC TAAACGCAGT
GATTGCCGTC GACTCTTCCC TCCCTTGACT GAGTTAATCC GTTTCTATGC AGTCGGTGGT
GATGGTCAAG AGGCATTGCC CGGCGCCACC CTTGCCCAAC CGCTGCAAGT GGCCGTCACC
AACGGCCAGC AACCGGTCAC CAATGCCCGT GTGCGCTTTC GGATCGTGCC CCAAACTGCT
GCCGGCCAAC TCACTGCCGG CGGCGCCACC GGCAAGAGCG TTGATGTGAC TGCCGGCCCA
AACGGTATCT ACTCCTGCCA GTGGCAACTC GGCCCCGCCG TCCAGAGCCA GCGCGTCGAA
GCCTTCTTGG TCGAGATCGA CGGCAAACCG TTTGTCGATA GCGCCGGCGA ACCGTTGCTG
CCGCGCGTCT TCTTCAACGC CAACCTCAGC CGCGCCAGCA ACGTGGCTTA CCAACCCGGT
ACGTGCTCCG ATCTAGCCAA TGCCAGCACC GTGCAAGAGG CGCTCGACAT CCTTTGCCGG
CGGCCCCGTG GCGGTGGCTG TTGTGTAAGC GTCGGCGACG GCGGCGAGTT TCCCAATCTT
GAAACGGCCC TCAAAGAGTT GCTCGCTCGC GGCGAGCGTC GCATCTGCCT CTGCTTGCGC
TCCGGCCGTC ACGCTGCCGG GAGTATCCAG CTCGATCTCC CGCCCGCCAA CGAACCGTTC
GTGCTTGAAA TCAAGGGCTG TGGCCCGGCC ACCAGTGTGC GCTTTGGTGG CCCGCTGCGG
TTAAGAGGGT TTGATTCGGT GGTATTGCGC AATCTCTCCA TCGACATCGC TTTCATACCT
GAATCCGGCA CGGCAGCCCT CTCGCTTGAT CGTTGCCTAC ACGTGATCAT CGCCGATTGT
GCGATCAGTG GCCTCACTCT GCCGGGCCAG CTCAACCAAG AGCAATTCAT TGATGGCAGC
GCCCTCGTTG CCATCACCGG CAGTGATGAT GTACGGCTAA GCGGCAATCT GTTCGTGGCT
GCCGTGCCGA GTCGCACTTT CTCGCTCCTT CTTGAGTTAT TCGGCGAAGC CCGGCTTGAC
GTTTTTGCTA AGCTCTTCGT CGACGACGAA CGGATCATGA CCGTAGAATG GCGTCAATTG
GCACTTGATG TTGTAACGGA GATAGTCCAA CTCAATCCCG AACAGCGCCA ACAATTTGCT
CGTCAACTCC AAGCACAGGT GCAAGAGCGG GTAGCTGCTC TGACCTTTGT TGAGCTTCTC
CAACTCCAGA AACTCCTCTT GGCTCTCAAC CAAGCCGAAT CCAATGCAAG TGCCTTGCTC
GACATTCTCT TCGACCTACG TGCCGCCGCG GTCAAATCTC GTCCCGGTAC AGCGCTGATG
CTTCACGAAC GACGTTCGTT CAAGGAGCAG AATCTTGGCA AGATCATCGA TCTGGTTGAT
GAAGACGATC TGGTGGTGCT CGAACATAAT CGGTTTGATG GCATTGTCAG CCTCTACGGT
ATGCCTGCCC CTCTTGAGCT GGTTGTGTCA TTCGCCGATA AACTTCAAAA TCCCGATCAA
CAGACGAATA TCAATGAAGC CTATGTGGGT ACCGTTCAGC TTCGTAGCAA CCATCTGGTC
AGGCTGAGCA TTAGCTTTGA TTTGTTGAAC GAGATTGTCC GCGATAGTGA TAGGACTGTT
GTCTTCGATG TCTGCGCGCG CTTGTTGCTC GACGGCAATG TGATTGAGGG TATCGCCAAT
CTCACCCTCA GCCAGCATCT GATTGCCCAA GCCAATTCGT TCACTGCCAT GGCTTCACTG
CGCAGCAGCA CAAGTCCTGC CGTCACAAGT GCCGGTACTG TGGCTTCGCC GGTCTTGGGT
TGGTGTCTTG CTAATGCCGC AGCCTTCATC GGTAATCAGG GGTCGGCCCA ATCGCAAAAT
GGACGTCTGT TTGCTATATC GCCTATCACC GAACGGGTGG CAAACGTATT GATCCAAATC
ATATAA
 
Protein sequence
MRGDFTRWTF DRTKHFSGVL HQQGRVALDA DWNEQLAIDL ADERAARIDL IGRCGGPQGQ 
AGFDIEVAGD TLNITAGRYY VAGIRVENEQ TVAITAQPDL PVATLAQVAG LAANATLPAG
VYLAYLDVWE RHVTALEDAA IREVALGGPD TTTRMQVMAQ VKLLRVADPG AAFDCATSNE
AWDALLAGSS GMLEVRAEPD PTVNDPCIVP AQAGYRGLEN QLYRVEVHRI VSATRIALKW
SRENGSVAVG WSGQDPLDSN RLIVASTGRD EVLGLAPNQW VELTDDARDK RGEPGLLVKV
IQVEGNVITI DPNGQTIVYS AFGPNPKLRR WDMPADTGEL IVTTSAATWT DLEQGVQIRL
KNGTFRPGDY WLIPARTVTG DIEWPRSGNQ PVPQLPHGIH HAYCKLALLD FNGSVWAKRS
DCRRLFPPLT ELIRFYAVGG DGQEALPGAT LAQPLQVAVT NGQQPVTNAR VRFRIVPQTA
AGQLTAGGAT GKSVDVTAGP NGIYSCQWQL GPAVQSQRVE AFLVEIDGKP FVDSAGEPLL
PRVFFNANLS RASNVAYQPG TCSDLANAST VQEALDILCR RPRGGGCCVS VGDGGEFPNL
ETALKELLAR GERRICLCLR SGRHAAGSIQ LDLPPANEPF VLEIKGCGPA TSVRFGGPLR
LRGFDSVVLR NLSIDIAFIP ESGTAALSLD RCLHVIIADC AISGLTLPGQ LNQEQFIDGS
ALVAITGSDD VRLSGNLFVA AVPSRTFSLL LELFGEARLD VFAKLFVDDE RIMTVEWRQL
ALDVVTEIVQ LNPEQRQQFA RQLQAQVQER VAALTFVELL QLQKLLLALN QAESNASALL
DILFDLRAAA VKSRPGTALM LHERRSFKEQ NLGKIIDLVD EDDLVVLEHN RFDGIVSLYG
MPAPLELVVS FADKLQNPDQ QTNINEAYVG TVQLRSNHLV RLSISFDLLN EIVRDSDRTV
VFDVCARLLL DGNVIEGIAN LTLSQHLIAQ ANSFTAMASL RSSTSPAVTS AGTVASPVLG
WCLANAAAFI GNQGSAQSQN GRLFAISPIT ERVANVLIQI I