Gene Cagg_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1354 
Symbol 
ID7268646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1678391 
End bp1679704 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content54% 
IMG OID643566197 
ProductWD-40 repeat protein 
Protein accessionYP_002462697 
Protein GI219848264 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0105276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000574581 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGCACC GTTCTTCCTT CAACGAACCC ATTGAGTCGT CAGCTCCTGC GTCGTCAGGG 
TTGGAACATC TGCGACAACA ATTGCTGATA GCATATGATG GCGCATGTGC AATTAGTGGT
TGTCAAGTTG ATGCTGTCTT GCGTCCAGTT TTGATCGATC CTAATGGGCC GGCTGAGCCG
TATAATGCGT TGTTGTTGCG GGCCGATCTC CAGCAGCTTT TTCAATCCGG TCTGCTGACC
ATCGATGCCA TGACCCTCAA AGTGCTTGTG GCGCCGGCGT TGCAGAATAG TGAATACAGA
GCACTGGCAG GTAAACGGCT CCGTCAGCCG AAACGGTTGC CCCTTCGACC AAGCCGACAT
ATGTTGGCTG CTCATCATCG CCTTTTTCAA CTTGAGCGAC CGTCTCTTGA CTCTTGTACG
GCGCGACCGC CTTCTCTACA GCGATTATTG GCCGTGCAGA GTTGGGTGAA AACGCTGGCA
TTTAGTCCCG ATCAGCAGAC CTTGGCGACC GGTTCTCTGG ATGGGAAACT ACGGCTTTGG
CGGTGGTCTG ATGGTCAATT GCAGCGCGTG CTGAGCAGTA GGATTGATGA AATCAATGCA
GTGGCTTTTA GTCCTGATGG CCAACGGATT GCGGCTGCAG GTCGTCAGGA TGGGGTACAG
GTCTGGCGAG TAGCCGATGG AGAACCGCTC CTGTATCTCC GTAACGACCA ACGCCATGGA
GCACTTTTTA GTGTAGCTTT TCAGCCCAAT GGTGATCTGA TCGCGGCTAC CGGCTGGGCA
CCGGTTATCT GGCTGTGGAA TGCGACTGAT GGCAGCGTAA GTGGGGGCTT ATCCGGTCAC
GAAGGCTTCA TCAATAGTTT GGCATTCCAC CCAGGTGGCG ACTTGCTCTT ATCGGGTGGC
CAAGACCGGA TTGTCCGACT CTGGCGTATC CCCGATCGAT CGTTGGTTCG TGAGATGCAT
GGTCACGATG ACGAGATTCT CAGTGTTGCA TTTAGCGCTG ATGGCGAATT AGCCGCCAGT
GCAAGTGCTG ATGGGGTGAT TATTGTCTGG CAGGTCGCTC ATTGGCAACC GGTGCAGATG
TTGCCTTCCT ATGCTGGAGC GTGTTCGAGT CTTGCGTTTA GTCCTGATAA TCGGTATTTG
GCGAGCGCTC ATGATGGTCG GACTGTGCTC ATGTGGCAGG TAAGTAATGG AGAACTGCGT
TGGGAACTGC GAGGTCATGG CGAACGTGTG ACGTGTGTGG CATTTGCACC GCGCGGGAAT
GTCCTGGCGA GCGGGAGTTT TGATGCGGTA GTGCGAATTT GGGCGTATAA GTAA
 
Protein sequence
MQHRSSFNEP IESSAPASSG LEHLRQQLLI AYDGACAISG CQVDAVLRPV LIDPNGPAEP 
YNALLLRADL QQLFQSGLLT IDAMTLKVLV APALQNSEYR ALAGKRLRQP KRLPLRPSRH
MLAAHHRLFQ LERPSLDSCT ARPPSLQRLL AVQSWVKTLA FSPDQQTLAT GSLDGKLRLW
RWSDGQLQRV LSSRIDEINA VAFSPDGQRI AAAGRQDGVQ VWRVADGEPL LYLRNDQRHG
ALFSVAFQPN GDLIAATGWA PVIWLWNATD GSVSGGLSGH EGFINSLAFH PGGDLLLSGG
QDRIVRLWRI PDRSLVREMH GHDDEILSVA FSADGELAAS ASADGVIIVW QVAHWQPVQM
LPSYAGACSS LAFSPDNRYL ASAHDGRTVL MWQVSNGELR WELRGHGERV TCVAFAPRGN
VLASGSFDAV VRIWAYK