Gene Cagg_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0056 
Symbol 
ID7269053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp77590 
End bp80133 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content54% 
IMG OID643564929 
Productvon Willebrand factor type A 
Protein accessionYP_002461445 
Protein GI219847012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.854643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.269901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGACC GTCGTATCCG TACCTACCGT CTATCACGGC GAACACGGGG GCAGAGCATC 
CCACTCCTTG CCCTCATGAT CGTCGTGCTG ATCGGGATGG TAGCATTATC GGTGGATGTA
GGGCGCACCT TCTCAGAAGA GCGCCGCGCC GTCGCCGCCG CGAACGCCGC TTCACTATCG
GCGATGAATA CCTATGTTCG CCGGCCCGCC GGTACAACCA ATAAAGTTAT CTATGACTCC
ATTGTGAATT CGCTGCGTTC AAACGGGATC GATATCGAAA ATAATCCGAA TATTCGGATG
GAGGCCTATT ACCTCAACGG CCGTGGTGAG CCGATTGAGG GCGGTGCCCG TATCAATCCC
GATGGTACGG TGGCACCTGA TAACGTGGCG TACATTCAAG TCAACCTCGA AGGTGACGTC
GATACCTTCT TTGCCCGCGT TGTGAATCAG AATCAGTTAC CGATTGGTGC TACTGCATAC
GCCGGTACCT GCCCGCCCAC CGATGGCGTC TATCCGATTG CGGTGAATAA TGAGTACATC
AGTGGTAATG AGTTCCGTAA TCCGGGTGAC GCAAATGGTG ACGGCAAGCC TGATAATAAC
TGGCAAAAAC TGACCAGTGG GCCCTACAAA GGCTTTACCA AGATGCGCCT GTACCCGACC
GATGGTAACC TGCCCGGTCA GTTTGGATGG TTACGCTGGC TTGACGGTCG TGGTGCGAGT
GGTGCAAATG CCAACAGCAA CCAAGAGCTA GAGTTAGCGT TAACCGGTAC CGGTTCACTT
TCCAAGGGCT TTATGGAGGT CGTTCCATGG CCGGCTACCA ATCTTCCCCG TCCGGCTAGT
TATCCCGAAC GACCCGGTGA GTTAAATGTT GGTGACTGGG TTTACGGTAG CTCTGGCTAC
AACAACAGTG TCGGTGTGCG CAACGCACTT GACGCTCACA TTGCTGCCGG GACGCGGATG
GTACTGCCGA TCTACGATAT TGCTGTTGGT CAAGGATCGA ATGCTGCCTT CCGGGTTGTG
CGCTTCGGGT TGTTTGTGCT GACTGCATAT GGGCAGGAGC GGGGTAAGCC TTATCTCGAT
TTCATCTTCC TCGGTGATCC GAATCGCCAA GGTACAGCCT GCTCGGCAAC ACCACCGCCG
CCGGAGAATA CCAGTGTTGT GCGCCTGACC GGGAGTGTTG AGCTGTGGCC GGAGTATCAG
ATTGTCGTGA ATGAGCGTCG TCCGGTGCAA TATGTGGTCA TTCTTGATGT CTCCGGTTCA
ATGAATGCCA ACTTCATCGG CCAAGGTATC GTGAATGGGC GGGTGACCCA ATGCACAAAT
GGGCCGCCCG GATCGCCGCC AGCTCAAAGC TGTGGTCAGC CCCAATATGC ATGGAACCCG
GTGCAAGAGC GGCGCATTTA TGTCGCCAAA AAGGCGCTTG AGTTGCTGAT CCGGCAAACG
AATATGCCCG GTAACCCCGG TTACGATCCA ACCCAGCCAA TTGACAGTAT GGCGCTGGTC
TGGTTTACGC ATAACGTTCC GAGTACGAAT GTCGTACCCT TCAAATCGAA TCCAAATGAA
CTGATTCAAG CCGTTAATAG TGCCGGTGCA TATCAGGGTG ATCCATACAA AACAAGTGGT
GGTACTAACG GTACCGGTGG ATTGTACCGT GCCAGTCAAT TGTTGGCTAA TGCACCTAGA
ACCACCAACC AACTTGGTAA GGAGTGGATC TATCGGCGGG CCATTATCTT CGTAACCGAC
GGCGTGACGA ACACCTTCTT TAATGCCAAT AACTCGAATG TGAATGGTGG AAGTAGTAAC
CAGACTACGT ACCCCACAGG CCATGTTTGT CGGAAGGCTG AGGTGCTCGA AGATGCACTG
TGTCAGACAA CCGAGGTCGG TGGTAAGTAC AATGGGATGG ATCGACCGAT TACGCAAATG
GTGAACATGA CCAACACTAT CAAGTCTAAC CAGTCGATCC AGACCGATAT TTACGCACTA
GCACTCTCGT CGATCCCGGC GACCGGTCTA CGTGATGGCG TTGCTAGTAC ACCGCGCCAC
TTCTATACTG CGGAAACGCT TGAACCGGGT CCTGATGGGT TGAATAACGT CGACCGCATC
ATGCTAGCAA TCAATGCTGA GATTGAGCGT GGACCGTGTA TGAGCGGTAG TGATGGTGAG
TGGCGCGCTA CTATTCCGGG TAATCACTTC CAGTCGGTGG GTGGACTGAG CTATCCGAAT
GTCGGTGAGG TGATCTTACA GGATATCTCG ACCAATAGCA TCTATCGGGC ACCGATTGTG
GCCGGTACTG ATGGTCGGGT ACGGTACACC TTTGAGGAGA TTCCACGCGG TACCTATCGG
ATGCAAGCTT ATCTCTTCTA CCGCCACCCA CTTGATCCGC CAACAGCGGC ACCGCGGATG
TATAGCCAAA TCTTTGCCAA TGGCTCGACC CAATCGGATA TGGTGGTGGT GCTTGAACCG
AATGGTCAGG GAGCTGGTTT CATCTCAACG ATTGAGCAGA ACCTACGCTT GCGCCTCGAC
GGTAATGTGT GCGCAGTGAA CTGA
 
Protein sequence
MLDRRIRTYR LSRRTRGQSI PLLALMIVVL IGMVALSVDV GRTFSEERRA VAAANAASLS 
AMNTYVRRPA GTTNKVIYDS IVNSLRSNGI DIENNPNIRM EAYYLNGRGE PIEGGARINP
DGTVAPDNVA YIQVNLEGDV DTFFARVVNQ NQLPIGATAY AGTCPPTDGV YPIAVNNEYI
SGNEFRNPGD ANGDGKPDNN WQKLTSGPYK GFTKMRLYPT DGNLPGQFGW LRWLDGRGAS
GANANSNQEL ELALTGTGSL SKGFMEVVPW PATNLPRPAS YPERPGELNV GDWVYGSSGY
NNSVGVRNAL DAHIAAGTRM VLPIYDIAVG QGSNAAFRVV RFGLFVLTAY GQERGKPYLD
FIFLGDPNRQ GTACSATPPP PENTSVVRLT GSVELWPEYQ IVVNERRPVQ YVVILDVSGS
MNANFIGQGI VNGRVTQCTN GPPGSPPAQS CGQPQYAWNP VQERRIYVAK KALELLIRQT
NMPGNPGYDP TQPIDSMALV WFTHNVPSTN VVPFKSNPNE LIQAVNSAGA YQGDPYKTSG
GTNGTGGLYR ASQLLANAPR TTNQLGKEWI YRRAIIFVTD GVTNTFFNAN NSNVNGGSSN
QTTYPTGHVC RKAEVLEDAL CQTTEVGGKY NGMDRPITQM VNMTNTIKSN QSIQTDIYAL
ALSSIPATGL RDGVASTPRH FYTAETLEPG PDGLNNVDRI MLAINAEIER GPCMSGSDGE
WRATIPGNHF QSVGGLSYPN VGEVILQDIS TNSIYRAPIV AGTDGRVRYT FEEIPRGTYR
MQAYLFYRHP LDPPTAAPRM YSQIFANGST QSDMVVVLEP NGQGAGFIST IEQNLRLRLD
GNVCAVN