Gene Cagg_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3040 
Symbol 
ID7266571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3695169 
End bp3696287 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content58% 
IMG OID643567860 
Producthypothetical protein 
Protein accessionYP_002464334 
Protein GI219849901 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000024541 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACCGTC TTATCCTGCT CGCACTTGCC TTGCTGGCGC TTATCAGTGG ACAATTCTTA 
CAACCACGGC TCCCACGCCC GCTCAACGTG CCACCACCAC GCGCAGTTAT CACCACAAAT
CCGTTAATCG GTGTCCATAC GCGCCTGACC GGGATCGGTG ATGAAGCCTA CATTCGCCAA
ACGCTTGCGC AAGTCAACGA GATGGGAGCA AGCTGGATTG TAGAACTCTT TCCATGGGCC
TACATTCAGC CGCGTTCCCG TTATGGCTTC GATTGGACAG GGGCCGATAT GGTGATTGCC
CATGCCCGTG CTCAAGGCCT ACAAGTCGTT GCGCGGCTCG ACATTGTACC GGCTTGGGCG
CGTCCACCCA ATACCACCGA CCGCTATCTC GATCGCGACC ACTTTGCCGA TTTCGCACGC
TTTGCTGCTG TATTTGCCGC TCGGTACGCA CCGCAAGGAG TACGCCATCT TGTGATTTGG
AACGAGCCGA ATCTGCGTTT TGAGTGGGGT GAACGTCCAC CTGATCCCGG TGCATATACC
GATCTGCTCA AACAGACCTA TCCGGCGGTC AAGGCGGTTG CCCCAGAGAC CATCGTGATC
GCCGGAGCAC TTTCCCCCGG TCCCGGTCTT GAAGGAGGGA ATCTGCGCAT GGACGATCTG
CAATTTCTCG CTTCACTCGC CGATGCCGGC GCATTTCCCT TTTTCGATAT GTGGGCCGTT
CATGCCTACG GCGGCCTTGA ACCGCCAGAA ACCGACCCCG CACCCGACCG GGTTAATTTT
CGACGCATCG AGCTGGTGCG CGAACTGCTT GACCGACTCG GCGGCTCTGA CAAACGGATC
ATCATTACTG AAGGTGGCTA TAACGACCAC CCACGCTGGA GTGGTGCTGT GCGCCCCGCC
GACCGTGTAC GCTGGACGAT TGCCACCTAC GAATGGTCAC GCCGATACCC GTGGCTAGAA
GCGACCATTC TTTGGCAATT CAGTACACCG TTCCGTACTC GCTCATACCC CGATGCCTGG
AACTTCGTCG ACCCTGACGG CACACCTCGC GCTATTTATC TGGCCGTGCA AGAGTATGCC
CGTACCGGTA AACTGCCGGA GCCGATGAGC CGGCCCTAG
 
Protein sequence
MYRLILLALA LLALISGQFL QPRLPRPLNV PPPRAVITTN PLIGVHTRLT GIGDEAYIRQ 
TLAQVNEMGA SWIVELFPWA YIQPRSRYGF DWTGADMVIA HARAQGLQVV ARLDIVPAWA
RPPNTTDRYL DRDHFADFAR FAAVFAARYA PQGVRHLVIW NEPNLRFEWG ERPPDPGAYT
DLLKQTYPAV KAVAPETIVI AGALSPGPGL EGGNLRMDDL QFLASLADAG AFPFFDMWAV
HAYGGLEPPE TDPAPDRVNF RRIELVRELL DRLGGSDKRI IITEGGYNDH PRWSGAVRPA
DRVRWTIATY EWSRRYPWLE ATILWQFSTP FRTRSYPDAW NFVDPDGTPR AIYLAVQEYA
RTGKLPEPMS RP