Gene Cagg_0755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0755 
Symbol 
ID7268074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp935526 
End bp936782 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content56% 
IMG OID643565606 
Productvon Willebrand factor type A 
Protein accessionYP_002462115 
Protein GI219847682 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.802956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAT CAGTCACGTT GCGCTGCCAA TGGGGACGCA CGCCTGTGCC CACAAGTAGC 
ACGCCACAAG TTGTCTATCT GTTGGTGGAA GCGGTTGCTC CTGCTTCACC AACTTCAGCG
TTGCCACTCA ATCTCTGTTT TGTCCTCGAC CGTTCAGGGT CAATGCAAGG TGCGAAACTT
GAGAGCATGA AGGCAGCAAC CCGCCGGGTG ATTGAATTAT TGCGTCCGCA CGACGTAGCA
GCTATCGTCA TCTTTGACGA TACGGTCCAA ACCCTCATAC CGGCGACTCC GGTTGGTGAT
CGGTCGGCAC TGCTCGCAGC AGTTGAGACC ATTACCGAAG CCGGTGGGAC GGCAATGTCG
CTCGGGATGC AAGCGGCGCA AACCGAACTC CAAAAACACC TTGGACCTGA TCGGATCAGC
CGGATGCTGT TGCTGACCGA TGGGCAGACG TGGGGTGATG AGCCAATCTG TCGTGATCTG
GCCCGCACCC TTGGGCAAGC AGGTGTGCGC ATTACCGCAT TGGGACTAGG CACAGAATGG
AATGAGCAGT TACTCGACGA TATTGCTGCG GCGAGCGATG GGTATTCCGA TTATATTGCC
GATCCGGCAC AGATTGAGAC GTTTTTTCAG CAGGCAGTGA AAGAAGCACA GGCTGTCGTT
GCTACCGATG CACGGCTGCT CCTCCGGCTT GTCCGTGACG TGACGCCGCG TGCCATTTAT
CGCGTCAAGC CGGTGATTGC GAACCTCGGT TACCAACCCA TCGGCGATGC AGCAGTTGCG
GTGCGGCTAG GCGATTTAGT CGGTGGGCAA CCGGCAGCCG TCTTACTCGA CCTGATGCTT
CCTCCACGCA CGCGAGGCCG GTTTCGGATT GCGCAGGCTG AGTTACATTT GACACCGGTT
GATCAACGGA GTGAAACGGT GATCAAACAA GATATCTTGC TCGATGTCGC CGATCAGGCT
GGGCCAGAGA GTTATGTTCC CGATGTCATG AATCTAGTCG AGAGGGTAAC GGCGTTTAAG
TTGCAGACTC GCGCCTTAAG TGAAGCAGCA AGTGGGAATA CGGCGGGTGC AACCCAAAAA
CTCCGTGCAG CCGCAACTCG CTTGCTCGAT CTAGGTGAAC TAGAGCTTGC CGCGAAGATG
AATCAACAAG CGGCAACGCT CGAACAGGGT CAACCGCTCG ATCCGGCTAC CCAAAAAGAG
TTGCGTTATG CTACGCGACG ACTGACCCAG CGACTAGAGA AAAACGAACA GGCATAG
 
Protein sequence
MSASVTLRCQ WGRTPVPTSS TPQVVYLLVE AVAPASPTSA LPLNLCFVLD RSGSMQGAKL 
ESMKAATRRV IELLRPHDVA AIVIFDDTVQ TLIPATPVGD RSALLAAVET ITEAGGTAMS
LGMQAAQTEL QKHLGPDRIS RMLLLTDGQT WGDEPICRDL ARTLGQAGVR ITALGLGTEW
NEQLLDDIAA ASDGYSDYIA DPAQIETFFQ QAVKEAQAVV ATDARLLLRL VRDVTPRAIY
RVKPVIANLG YQPIGDAAVA VRLGDLVGGQ PAAVLLDLML PPRTRGRFRI AQAELHLTPV
DQRSETVIKQ DILLDVADQA GPESYVPDVM NLVERVTAFK LQTRALSEAA SGNTAGATQK
LRAAATRLLD LGELELAAKM NQQAATLEQG QPLDPATQKE LRYATRRLTQ RLEKNEQA