Gene Cagg_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3004 
Symbol 
ID7266535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3670445 
End bp3671797 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content52% 
IMG OID643567826 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_002464300 
Protein GI219849867 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.339766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.010011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAG CTGTACTGCG TGCAATTCAG TTGCCTGATC TCCGGCGCCG TATCCTCTTT 
ACTCTGGGGA TGCTGTTCCT GTTTCGCCTG ATCGCGCATA TTCCGGTACC CAATATTAAT
CCGACAACGC TCGAACAGTT GCGACAGGCG CTGGCGACCA ATCAATTGGC GCAACTGCTG
AATCTGTTCG CCGGTGGTGC GTTGGAAAAC TTCTCGGTGG CTGCGATGGG GGTGTATCCC
TACATCACCG CCCAGATTAT TATGCAGTTG TTGCAACCAC TGATTCCCGC GTTGCAAGAA
TTGCAAAAAG AGGGTGAACA GGGTCGTCTA CGCCTCAATC GCTACCAATT GTGGTTAACG
ATCCCGTTGG CGTATTTGCA GGCCTACGGC CAAACCCTGA CACTTGAACG AACTATCAAC
CCGAATAACG ATCCGACCGC ATCACTCTTC CGAACACCGT TCGATCTGGC GGCCAACTTT
CTCCCTACAT TTACCGTCCT TACCACGATG GTCGCCGGCA CGATGTTACT GATCTGGCTC
GGTGAACAGA TTAACGAGCG CGGGATCGGT AACGGTATCT CGATCATTAT CTTCGGCGGA
ATTGTTTCAC GGCTGCCGGG TCTCATTATT CAGGGCTTCC AGATTAGCCA GGCCGGTGAT
TTTAGCACCA TTCTTGGCCT CATCGGTTTT GTTGTCATTG CATTGCTGAC AATTGTCGGT
ATTGTGCTCA TCCAAGAGGG GCAGCGCCGG ATTCGCGTTC AATATGCACG GCGGGTCCGC
GGCAATAAAG TCTACGGCGG TCAGAGCAGC TTCATTCCGC TCAAGGTCAA TTCGGCCGGC
ATGATACCGC TGATTTTTGC CCAGAGTATT ATCATCTTCC CCGGTATCGT TGCCTCATGG
TTTTACCGGC CCGGTGCCGA GGGAATTGGT AATGCTATCG CCGGTTTCTT TTACAATACG
TTTAACCCAA CCGGTCAGGG CGGTGGGATT GTCTATATGA CGTTGCTCTT CCTCCTGACT
GTCGGTTTTA CCTATTTCTA CACCATTGTC ATCTTTACCC AACAGGATTT GGCCGAGAAC
CTCCAGCGCA ACGGTGGTTT CATTCCCGGC ATTCGGCCAG GTAAAAAGAC GGAAGAGTAT
CTGATGAAGG TACTTAACCG CATTACATTG GCCGGCGCAC TGTTTCTTGG CTTGATTGCA
GTGCTACCAT TCATCACCCA ATCGATTACC GGTGTGCAAA TTGGGTTGGG TAGCACGGCT
CTCCTGATCG TGGTGGGTGT GGCGGTGGAT ACGATGCGCC AACTCGAGGC ACAGTTGATC
ATGCGTGACT ACGAGGGTTT CTTGACCAGA TAG
 
Protein sequence
MLQAVLRAIQ LPDLRRRILF TLGMLFLFRL IAHIPVPNIN PTTLEQLRQA LATNQLAQLL 
NLFAGGALEN FSVAAMGVYP YITAQIIMQL LQPLIPALQE LQKEGEQGRL RLNRYQLWLT
IPLAYLQAYG QTLTLERTIN PNNDPTASLF RTPFDLAANF LPTFTVLTTM VAGTMLLIWL
GEQINERGIG NGISIIIFGG IVSRLPGLII QGFQISQAGD FSTILGLIGF VVIALLTIVG
IVLIQEGQRR IRVQYARRVR GNKVYGGQSS FIPLKVNSAG MIPLIFAQSI IIFPGIVASW
FYRPGAEGIG NAIAGFFYNT FNPTGQGGGI VYMTLLFLLT VGFTYFYTIV IFTQQDLAEN
LQRNGGFIPG IRPGKKTEEY LMKVLNRITL AGALFLGLIA VLPFITQSIT GVQIGLGSTA
LLIVVGVAVD TMRQLEAQLI MRDYEGFLTR