Gene Cagg_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0299 
Symbol 
ID7267480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp374592 
End bp376436 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content59% 
IMG OID643565167 
Productvon Willebrand factor type A 
Protein accessionYP_002461681 
Protein GI219847248 
COG category 
COG ID 
TIGRFAM ID[TIGR02226] N-terminal double-transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGC TAGCACCATT CGGATTACTC GCCCTCTTGA CCCTCCCGGT GATTGTTGTG 
CTCCATATGG TGCGCTCACG CCGGCGGCGG GTCGTTGTCC CGTCACTGCT GTTATGGCAG
CAGATCCCGT TACGTCCCTC GACGAGACGT CGGCGACTAC GACTAACGCT GCTCTTGCTC
CTCCATCTCA TTGCCGCCTT GCTGATTGGG TTGGCGTTGG CCCAGCCGCA AATCGTGTTG
CCGTGGTTTG GTCCGCATCG GTCAATTGCG ATCATTATCG ATACCTCGAC CAGCATGGCA
CTCGCCGAAG GTGGAACGAC GCGGCTAGAA CGAGCACGGC AACGCGCAGC AACGGTGATT
GAACAGATGG GATGGCAAGA TCAGGTAGTC TTGATTAGCG CCGGCCCAAG TGCTCGCCTC
ATCGACTATG GGAGGAGCGC CGAGCGGTCG CGTTTACTGG CTGCGCTCGC GGGAGTGGGT
GTCGAGGGTG CCGGTAGTGA TGTTAGCGGT GCCCTTACCA TTGCCGAGAC GTTATTGCTC
GATCAGCCCG GTGCGCGGGT GTTGTGGTTG ACCGACGGCG CGTTACCACC ACCCAACGCG
ATGGACCTAC ACCTTCCACT CCAGATCGAA GTTCTCGGCA CAGCCCAACC TAACCGGGCG
GTTGTCGCGT TGGCAGCCCG CCGAGGCTCT AGCGGGGCGG TCTATCTCTA TGCACGGCTG
GCCAATTACG GGACACAATC GTTTCGGGGG CCGGTCCGCC TCCTCGTTGA TGGGCAGCTT
TCTCAAGCCG AGACGGTAAG TATTCGCCCC AACGCAGTTG TTGAGCTGAC GTGGACCTTA
CCCGGCTCGA TTCAATACAT CCAACTCGAG TTCGCCGGCA ACGATGGCTT ACCGCTTGAC
GACACTGCTA CGGTGAATGT GGATGGTCAA CGCCCTGCCA GAGTCACACT GGTAGCCCAA
GCGGCATCGG CACTAGTACG GGTGTTACAG GCGATGCCGG ACGTAACCCT CACCGTCGTT
GAACCGGCGA CGTATACCCC TGATCCGACC ACCGATGTGA CGATATTTGT CAACACTGTG
CCGACCGAAT GGCCGGACGG CGGCGTACTC GTGATTAACC CGCCTGACAG CGGCTTACTG
CCGAACACAG GAACACGCCT CGCCGAACCG ATCGTTACCC TGACCCCTGC CGGTGAGCCG
CTGTTGCGTG ATGTTAATCT GAGTGGCATT GCCTGGGGAC GAATCGGCAT GTTGGAACTG
CCGGATTGGC TGACCCCGTT GGCTTTGAGC GGTGATACCC CGTTACTGGC GCGCGGACGT
TTTGAGCGTA GCGAGATTGC GGTCTGGAAC TTTGATTTGA ATAATAGCCC GCTGGTCGGT
CGGTTAGCGT TTCCTCTGCT CGTGGTGCGG ACGGTACGAG ATTTGATGCC GCCATCCCTC
CCATCATCAC TAACCCTCGG CACACCACTG ACCTACAAGG CCGATCTACG CGCTACCCAT
TTGGACGTGC AAGCCCCAGA TGGGAGTTGG CAGACGTATC CGTTGCAACC GGCTCTCCCG
ATCATTATTG AACCGACGCA AGCCGGTCTT TATCACCTAT CCGAATGGGC CGGCACACAA
CTGCTCTCCG CGATGACCAT TCCGGTCAAC GCCGGCGCAA TCGGCGAAGC TGATCCGACT
CCACGCCTGA CCAGTACAAC GCTTGGCCCG GTCGCAACAA CACCAACCAC GCAGCCGGTA
ACGGTGCCAC AACCGTTGTG GTCGTGGCTG CTTATCGCGA CCATTATTGT TCTCGTGGTC
GAATGGTTCT ACGTGCAGCG CCGGCCATCG GTGGAGGCAA GATGA
 
Protein sequence
MSLLAPFGLL ALLTLPVIVV LHMVRSRRRR VVVPSLLLWQ QIPLRPSTRR RRLRLTLLLL 
LHLIAALLIG LALAQPQIVL PWFGPHRSIA IIIDTSTSMA LAEGGTTRLE RARQRAATVI
EQMGWQDQVV LISAGPSARL IDYGRSAERS RLLAALAGVG VEGAGSDVSG ALTIAETLLL
DQPGARVLWL TDGALPPPNA MDLHLPLQIE VLGTAQPNRA VVALAARRGS SGAVYLYARL
ANYGTQSFRG PVRLLVDGQL SQAETVSIRP NAVVELTWTL PGSIQYIQLE FAGNDGLPLD
DTATVNVDGQ RPARVTLVAQ AASALVRVLQ AMPDVTLTVV EPATYTPDPT TDVTIFVNTV
PTEWPDGGVL VINPPDSGLL PNTGTRLAEP IVTLTPAGEP LLRDVNLSGI AWGRIGMLEL
PDWLTPLALS GDTPLLARGR FERSEIAVWN FDLNNSPLVG RLAFPLLVVR TVRDLMPPSL
PSSLTLGTPL TYKADLRATH LDVQAPDGSW QTYPLQPALP IIIEPTQAGL YHLSEWAGTQ
LLSAMTIPVN AGAIGEADPT PRLTSTTLGP VATTPTTQPV TVPQPLWSWL LIATIIVLVV
EWFYVQRRPS VEAR