Gene Cagg_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2400 
Symbol 
ID7267228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2917332 
End bp2918894 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content57% 
IMG OID643567226 
ProductTPR repeat-containing protein 
Protein accessionYP_002463709 
Protein GI219849276 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.38799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCGA CCAACGAGCT TCAATTTATG CCCCTCGACC TCGATACCTT CAGCGGTAGC 
GAGCGCTTCA TGAGCGGCAC CCGCCTAGGG GCAGCGTTCA GTCAAGGAAT GCGTTCCTAC
CTACGCGCGG CCTACGCCGA TGCGATCGAG CATTTCAAGG CTGCTCTCAT CGCCGCCTAC
GTTGATGGGG AAGAGCAAGC CCAGATTTAC GAGCGCGAAC GGGCAATTAT CTATCTCTAT
ATCGGCAACT CACTCGCCTT CCAAGATCAT TGGGAAGAGG CACTGCGCGA GTATCTCGAA
GCGGTACAGA CCGATCCGCA ATTAGCTGAG GCCCACTACA ATCTCGGCGT CGCCTTTGCC
GCCCTCGGCC AAATCGACCG AGCCATCGCC GCCTTCAAAG AGACGCTCGA ACACAACCCC
AACCTCTACG AGGCCCATTT CGCGCTTGGT CGCTGCTACC AACGCATCGA CGATGCCGGT
CGGGCGTATA TTCACTTCAG TAGCGCCTGC AACGCGCGCC CACAAGCTGC CGAGCCACGT
TACTACATGG GCTTGATGCA CCAAAGTCAC GGCGCGCACG AGCTTGCACA ACGGTGCTTT
GCCGAAGCGC TGCGCGTTGA ACCGACGTTT GTCTCACCCG AACCATTACC CGACGAGCCG
CTCGTCAACC GTAGCGAAGA AGAGGTGGCC CAGTGGTATT ACCGGCTTAG CCAGGCCCTC
AAGCAACAAG GATATGAAGA AGAGGCCGAG CGGATCTACC GTGCTTTACT CCAATGGCGC
CCCCAAGAGT ATGCTGCCCG TTATCTCCTG GGGAACCTCC TGGCCCGTCA ACGTCGGTTT
GACGAAGCCT ACGCCGAATA TGAGCAAATA CCACCGCAAC ATCGCCATTA TGTCGATGCA
CGTCTCCGCA TGAGCGCCAT CTTGCGTTTG CAAAAGAAGC CCAAACAGGC CTACGAAATC
CTTTTCGCCT GCGCCCGGCT TAATCCGCAC CACGGCCAGC TCTTCTTGCA GATGGGCAAA
CTCCTTTACG ATATGGGAAT GACCAGGCAA GCGGTCCGTG CCTTCGAGCG AGCCGTTCAA
TTGCTCCCGA CCGATGCTCA AGCTCATTAC CTCCTTGGAT TTGTGTACAA TACGATGGGG
CGCGACACGT GGGCACTCGC TGCTTGGCGC AAAGCCGTGC AGCTTGCCCC TGATGCGCAT
TCACTACGCT TTGATCTTGG CTACATGTAT ATCCGGCGTG GGCGATACGA TCTTGCAGCG
AAAGAGTTCC AGCAAGTACT CGAACAGTGG CCCGACGATA TAGAAACGCA GTTCATGCTC
GGATTATGCT ACAAAGAGCT GCTCGAACCG TCGCGCGCTA TCCCACTGTT TGAAAAGGTG
CTTCGCCGCA ATCCACGTCA CGCTCAGGCT CTCTACTACC TGGGTGCATG CTACCTCCAA
GTCGGCAACA CCTCTCTCGG CAAGGCGTAC CTGCGTCGGT ACGATCATCT TATCCGTCAA
ACCGAAACAA CGAACGGCAG TCGGTCGCGA TCACTACCCA AACCACAGTT GTCTTCACCA
TAA
 
Protein sequence
MASTNELQFM PLDLDTFSGS ERFMSGTRLG AAFSQGMRSY LRAAYADAIE HFKAALIAAY 
VDGEEQAQIY ERERAIIYLY IGNSLAFQDH WEEALREYLE AVQTDPQLAE AHYNLGVAFA
ALGQIDRAIA AFKETLEHNP NLYEAHFALG RCYQRIDDAG RAYIHFSSAC NARPQAAEPR
YYMGLMHQSH GAHELAQRCF AEALRVEPTF VSPEPLPDEP LVNRSEEEVA QWYYRLSQAL
KQQGYEEEAE RIYRALLQWR PQEYAARYLL GNLLARQRRF DEAYAEYEQI PPQHRHYVDA
RLRMSAILRL QKKPKQAYEI LFACARLNPH HGQLFLQMGK LLYDMGMTRQ AVRAFERAVQ
LLPTDAQAHY LLGFVYNTMG RDTWALAAWR KAVQLAPDAH SLRFDLGYMY IRRGRYDLAA
KEFQQVLEQW PDDIETQFML GLCYKELLEP SRAIPLFEKV LRRNPRHAQA LYYLGACYLQ
VGNTSLGKAY LRRYDHLIRQ TETTNGSRSR SLPKPQLSSP