Gene Cag_0486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0486 
Symbol 
ID3746355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp565456 
End bp568353 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content43% 
IMG OID637773020 
ProductTPR repeat-containing protein 
Protein accessionYP_378802 
Protein GI78188464 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGATA ATAATTCAAC ACATATTCAG CCTGCTGGCG GTTTTGCGGC TATCAGCAAA 
TACAATCCGC ATCTTTGGAG TGCGGAACAA TTACGAGCTA TTTTTGTTGC ACGAACAAAC
GAATTAGCTG ATTTGGTGCA GACGTTGCGC ATGGTACAAC CTGATACAGT AGCCCAACAT
GTTTTGTTAG TTGGTGCGCG TGGAATGGGT AAATCCACTC TTATGCGACG TTTAGCATTA
GCGGTGGAAG ATGATCCTTC GCTTTCTGCT AATTGGTTGC CGTTGCGGTT TCCCGAAGAG
CAATACACCG TTGCTACACT TGGTCAATTT TGGGCTAACG TTCTTGACAG TTTTGCCGAT
ACGTTACAAC ATCTTGGCGA ATCTGTTATA GCGTTGGATG CTGCTGCTGA GCGTATTGCA
GCGTTACCTG TAACCCAACA GCCCGAAGCC TATATAGATG CCATTAATCA CTTTGCTGAT
GAACGTAAGC AACGGTTGTT ACTTTTGGTG GATAACACGG ATATGCTGCT CCATAACATT
GGGAAAGATG CTCATTGGGG ATTACGTGCA ACGTTGCAGA GCAATCCTCG CCTTTTCTGG
ATTGGTGGTA GTTACCAAAG TCTTGAAGCT GAGAGCAATT ATCACGATGC TTTTCTTGAC
TTTTTTCGAG TTATCAACTT GCGCCCGTTA AAAGTGGAAG AAATGCGTCA AGCGTTGCTT
GCTTTAGCTG AAACCTTTGG TGGTGCTACA GCACGAAATG CAATGGTGCA CCAACTTGAT
CTTCAGCCAG AGCGCTTGCC CACTTTGCGC CAACTCTCAG GTGGTAATCC ACGTACAACG
GTGATGCTGT ATGAAATTCT TGCCAATGGT CAAAATGGCA ATGTACGTAG CGATTTAGAA
GCGCTGCTTG ATAATATGAC GCCTCTTTAT AAAGCTCGTA TGGATAGCTT AGCTGATTTA
CAACGCAAAC TGCTTGCCCA TATTTTAGAG CATTGGGCAC CTATCTCTTT TGGCGAATTG
GCTGCGGTTT CTCAAGTAGC AAAAGGCACC ATTAGCCCAC AGTTGCAGCG TCTTGAAATA
GAAGGACTTA TTGAAAAAAC TTCTTTGCAT GGTACTACTC GTAGCGGCTA CCAAGCGGCT
GAACGCTTTT TTAACATTTG GTATTTAATG CGCTTTTCAC CACGTCGGCA GCGCAATCGT
CTTGTTTGGT TGGTGGAGTT TATGCGCTTA TGGTTTAGTG GCGATGAACT CTGCTCTTTA
GCAAAACAAC GAATGAGTGT TGGTAGCAAT GATTTGCGTT CAACGCATGA CTTGGAATAT
GATCGAGCAT TAGCTGATGC TCTACCGCAG TTTGCGCCCG AACGCCACGC TTTGCGCTGG
TCGTTGCTCA AGCTGTTACA AGAAAACAAC TCTCAGTTAG TGGAGCTTTT CGATTTTGAC
GGTGAGGATA AAGAGTTTAA AGGCGCTACT GATTATCTAC GCCGTTTAGC TGCACTACCG
TCACTGTTAC GCCAATGCCC TCATGCGGTA ACTGAGCAAG AGAAAACGCA TTGGGTTGAA
ACTGTGCTTG GTTCGATTAG CTTAACTTTG GAAGAGAAAG AAATTGTAGC TCAAAAGGCA
GAATATTTAA CTCTTTTTCA ATATGACGAG TTATTAAAGG TTTTTAGTGA AGAACAAAAA
AGATGGGAAA AACAGTTTGG GGTGGCTGCT TTGCAAACGG TACGTAGTGC TGTGTTGAGC
CAAGATTTTT TTTCTGATAT GCCTGATTCA CATCTTGCTT ATGAGCAAGT TCGTGTCTGT
TTTGCTAACA ATAAAGAAGC GTTACGTTTT GTTAGTTTAC TTTTTTATAG CAAGCATAAA
GACGAGTGGA GCTACAAAGC CCAGAAATTA GCACTGAACC TATTGCACGA TGATTCTAAA
AGTGATTGGT TCCTTCGAGA AAAGCTTGTG CGTTATGAAG AAACCGAAAA AGCTTATTGC
AAAGCAATTG AATTTAATAA GGAAGATGCT GTTACTTGGA ATCACGTAGG TAACTTACTA
AAAGATTATC TCGGTCGTTA TGAAGAAGCA GAAGCAGCGT ATCGTCAAGC GATAGCTATA
GATAAGAAGT TTGCTTATCC ATGGAATAAT TTGGGTCAGT TATTGCATTA CAATTTGAAT
CGCTATGAAG AATCGGAAGC AGCGTATCGT CAAGCAATAG CATTGGATGA GAAGTATGCT
TATCCATGGT TTAACTTGGG GCAACTACTG CACTATAAGC TCGAACGTTA TGAAGAATCG
GAAGCGGCGT ATCGTCAAGC GATAGCTATA GATGAGAATA ATGCTTATCC ATGGAATAAT
TTAGGTCAGT TATTACATGA ATGGTTAGGT CGTTATGAGG AAGCAGAAAC AGCGTATCGT
CAAGCGATCG CGTTAGATGA GAAATACGTT TATCCTGTAA CAAATCTTGC TCGCTTGTTG
GCTCAGCGTA ATCGCAAGGC AGAAGCCGAA ACCTATTATC GTGAAGCTGT CTTAAAAGAC
ACTCAAGACA CTCAACAATT ATTCCTTCAA GCGCATCTTT TCTTAGGAAA TCGTCAGTTA
GCTATGGACG CATTGCAAGC CTTAGCTGAA AAAGCGCAAA ATGGTAATCA ATATGCGTTT
TACCGTCTCA AAGAACAGGT TTGGGAGTGC TATGAGCTTG GGTTAGGTGA ACGCTTAGCA
GATTGGATGG CAGAAAGTAA TGTGGCAGAG TTCCTTACGC CATTTATTCA AGCTCTTTAC
ACGCTTGCAG GTGTTAATGA GAAATTGCGC GACTTACCAA TGGAAAGCCA ACACATGGTA
GATGAGATTG TTCGCAAAGC GCGTTTGCGG CAGGAAAAAC GAGAGGCTTG CAACATGCGG
GCAAAATCAA TTCATTGA
 
Protein sequence
MLDNNSTHIQ PAGGFAAISK YNPHLWSAEQ LRAIFVARTN ELADLVQTLR MVQPDTVAQH 
VLLVGARGMG KSTLMRRLAL AVEDDPSLSA NWLPLRFPEE QYTVATLGQF WANVLDSFAD
TLQHLGESVI ALDAAAERIA ALPVTQQPEA YIDAINHFAD ERKQRLLLLV DNTDMLLHNI
GKDAHWGLRA TLQSNPRLFW IGGSYQSLEA ESNYHDAFLD FFRVINLRPL KVEEMRQALL
ALAETFGGAT ARNAMVHQLD LQPERLPTLR QLSGGNPRTT VMLYEILANG QNGNVRSDLE
ALLDNMTPLY KARMDSLADL QRKLLAHILE HWAPISFGEL AAVSQVAKGT ISPQLQRLEI
EGLIEKTSLH GTTRSGYQAA ERFFNIWYLM RFSPRRQRNR LVWLVEFMRL WFSGDELCSL
AKQRMSVGSN DLRSTHDLEY DRALADALPQ FAPERHALRW SLLKLLQENN SQLVELFDFD
GEDKEFKGAT DYLRRLAALP SLLRQCPHAV TEQEKTHWVE TVLGSISLTL EEKEIVAQKA
EYLTLFQYDE LLKVFSEEQK RWEKQFGVAA LQTVRSAVLS QDFFSDMPDS HLAYEQVRVC
FANNKEALRF VSLLFYSKHK DEWSYKAQKL ALNLLHDDSK SDWFLREKLV RYEETEKAYC
KAIEFNKEDA VTWNHVGNLL KDYLGRYEEA EAAYRQAIAI DKKFAYPWNN LGQLLHYNLN
RYEESEAAYR QAIALDEKYA YPWFNLGQLL HYKLERYEES EAAYRQAIAI DENNAYPWNN
LGQLLHEWLG RYEEAETAYR QAIALDEKYV YPVTNLARLL AQRNRKAEAE TYYREAVLKD
TQDTQQLFLQ AHLFLGNRQL AMDALQALAE KAQNGNQYAF YRLKEQVWEC YELGLGERLA
DWMAESNVAE FLTPFIQALY TLAGVNEKLR DLPMESQHMV DEIVRKARLR QEKREACNMR
AKSIH