Gene Cagg_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0723 
Symbol 
ID7266975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp901446 
End bp902705 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID643565574 
Productvon Willebrand factor type A 
Protein accessionYP_002462083 
Protein GI219847650 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0959383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAG AAGTTTCTCT GCGCGCCGTG TTAGCGCGTC CCTTTTTAGC AGCCACAACT 
ACGCCACAAG TGGCCTACGT GCTGCTCGAA GCGCAGCCGG CGCCACAGAT GACGCAGGTG
CGAATGCCGG TCAATGTCTG TTTTGTGCTC GACCGGAGCG GTTCGATGAA GGGTGAGAAG
ATCGAGCGAT TGCGACAAGC AGTGGTGAAG GCGATTGAGC TGCTCGATCA GCAGGACTCG
CTTGCGATTG TGATTTTCGA TCATCGTACC GAAGTATTGG TTCCGGCTCA GCCGGTGCGT
AACCGCGCGA TGATCCTCGA TCTCGTTCAC CGTATTCGTG ATGCCGGTGG GACGCGGATT
GCACCTGCGG TCGAAAAAGG GTTGCAGGAG TTGCAGAAGA TGCCGCCGGG TGTACGCCGT
CTCATATTAC TTACCGACGG CCAAACCGAG CACGAGAACG AGTGTTTGCT GCGGGCCGAC
GATGCCGGGC GGCTTGGTGT GCCGATTACT GCCCTTGGCA TCGGCAAAGA CTGGAACGAA
GATCTCTTGA TCGAGATGGC GAATCGCTCG AAGGGAGTGG CCGATTATAT TGCCCAACCC
GGCGAGATTG TAAACTATTT TCAACATACC GTGCAGCGTG CCCAACAGAC CGTCATTCAG
AACAGTGTGC TTACCTTGCG GTTCGTGCAG GGCGTAACGC CGCGCGCCGT CTGGCAAGTA
ACACCGCTGA TCGACAATCT CGGCTACCAA CCGATCGGTG ACCGGGCGGT GAGTGTGAAA
CTCGGCGAAC TCGAAGGTTC GCAACCCCGG ATACTCCTGA TCGAACTGTT GATCGATCCC
CGTCCGCTCG GCACCTACCG GATCGGGCAG GCCGAGTTGA GCTACGACGC GCCGGCGTTG
CAGTTGGTGG GAGAAAAAGC CAAGTTGGAT ATTATGCTGA CCTTTACCAA TGATCCGGCT
CAACTGCAAC AAGTGAATCC GACGGTGATG AACATCGTCG AAAAGGTGAG TGCGTTCAAG
CTCCAAACCC GCGCCTTGCA AGACCTTGCC GCCGGCGATG TGAGTGGCGC AACCCAGAAG
CTGAAGAGCG CGGTGACGCG CTTACTCAAT CAGGGAGAAG TGGAGTTAGC GGCGACGATG
CAGCAAGAGA TTGCCAACCT CGAACAACAA GGCCAAATGT CGAGCGAAGG TCAAAAGACG
ATCAAGTTTC AGGGCCGCAA GACGGTGCGC TTAACCGACA TCGAATTGCC GAAGGAGTGA
 
Protein sequence
MAGEVSLRAV LARPFLAATT TPQVAYVLLE AQPAPQMTQV RMPVNVCFVL DRSGSMKGEK 
IERLRQAVVK AIELLDQQDS LAIVIFDHRT EVLVPAQPVR NRAMILDLVH RIRDAGGTRI
APAVEKGLQE LQKMPPGVRR LILLTDGQTE HENECLLRAD DAGRLGVPIT ALGIGKDWNE
DLLIEMANRS KGVADYIAQP GEIVNYFQHT VQRAQQTVIQ NSVLTLRFVQ GVTPRAVWQV
TPLIDNLGYQ PIGDRAVSVK LGELEGSQPR ILLIELLIDP RPLGTYRIGQ AELSYDAPAL
QLVGEKAKLD IMLTFTNDPA QLQQVNPTVM NIVEKVSAFK LQTRALQDLA AGDVSGATQK
LKSAVTRLLN QGEVELAATM QQEIANLEQQ GQMSSEGQKT IKFQGRKTVR LTDIELPKE