Gene Cagg_3732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3732 
Symbol 
ID7267805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4545926 
End bp4547182 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content56% 
IMG OID643568539 
Productvon Willebrand factor type A 
Protein accessionYP_002465004 
Protein GI219850571 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.285175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.068193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG TTGAACTGCG GATAACGCCT AGTCGCAGTG TGTTACCGGC CCTTAACGAG 
CCACAGCTCT TGTATGCGCT GATCGAGTTG TCGGCACAGA GCGGTGCAAC GAAGATGCCC
CGTCTGCCGC TGAACTTGTG TTTGGTGATC GACCGCAGCT CGTCGATGCG TGGTGAGCGT
TTGCAACAGG TGAAGCAGGC CGCGATGCAG ATTCTCGACC TGCTTGGTGA TAACGAGAGT
TTTGCATTAG TCACGTTCAA TGACCGGGCC GAAGTGGTGG TATCGTCCCA ACTGGCACGG
GCACGGGCTG AAATTAAACG CCAAATTAGC GCAATCGAAG CTGCCGGCGG TACCGAAATG
GCAACCGGTT TGGCGCTTGG TGTGCAAGAA CTGCAACGGG CGATGATGCC GCGGGCGATC
CATCGCTTAC TGTTGCTGAC CGATGGCCGT ACTTACGGTG ATGAGAGCCG TTGTGTCGAG
ATTGCGCGGC GTGCCCAAGC GCGTGGGATT GGGATTACGG CGTTAGGCAT CGGTAGTGAG
TGGAATGAAG ACTTGCTGGA AACGATCGCC GCGCGAGAGA ATAGTCGCAC GCACTATATT
ACGTCTGCCG CCGACATCAC CAAGATTTTT ACCGCCGAAG TTGAGCGTAT GCACAGTATT
TTCGCCCAAG ATGTGCAGGT GCGACTAGCC TTACCGCCGC AGGCCCTCGT CCGTTCGTTC
GACCGGGTAC GTCCTTTCAT CGGGCCATTA CCGGTGATGG AAGAGGCTGA TTCGGTCTGG
ACGGCCACAC TCGGTGATTG GCCTGAGCAG GACGTACAAG CTTTTTTGGT TGAAGTGGTG
ATACCGTCGT TGCCCGAAGG TCGTCATACG CTGATCCGAT TCAATCTGCG TTTTCGCATA
CCCGGAAGTG ATAATGCGGT GCAGAGCTAT GACCAGGTGT TGCAGGCTGT AGTTCGCGAT
CCGGCTGAGG TAAATGCTGA TGTTGATCCG ACGGTCAAGC ATTGGCTGGA ACGGTTGGTC
GCCTATCGGT TGCAGGCCAG TGCATGGCAA GCGGTTGAGG AAGGAAAACT AGAAGAGGCA
ACTCGGCGGT TACAAATGGC CGGTACGCGC CTATTTGAAG CGGGTCAGGT TGAACTGGCG
CGTGCCGTTC AAGAGGAAGC AACTCGCCTG CTCCGCTCCG GTCAAGCGAG TGCCGAGGGT
CGCAAACGGA TCAAGTATGG TACGCGCGGC TTGATCGGGC GTGAAGAGCA GTCATAA
 
Protein sequence
MSKVELRITP SRSVLPALNE PQLLYALIEL SAQSGATKMP RLPLNLCLVI DRSSSMRGER 
LQQVKQAAMQ ILDLLGDNES FALVTFNDRA EVVVSSQLAR ARAEIKRQIS AIEAAGGTEM
ATGLALGVQE LQRAMMPRAI HRLLLLTDGR TYGDESRCVE IARRAQARGI GITALGIGSE
WNEDLLETIA ARENSRTHYI TSAADITKIF TAEVERMHSI FAQDVQVRLA LPPQALVRSF
DRVRPFIGPL PVMEEADSVW TATLGDWPEQ DVQAFLVEVV IPSLPEGRHT LIRFNLRFRI
PGSDNAVQSY DQVLQAVVRD PAEVNADVDP TVKHWLERLV AYRLQASAWQ AVEEGKLEEA
TRRLQMAGTR LFEAGQVELA RAVQEEATRL LRSGQASAEG RKRIKYGTRG LIGREEQS