Gene Cagg_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1317 
Symbol 
ID7268608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1624893 
End bp1627946 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content43% 
IMG OID643566160 
Productvon Willebrand factor type A 
Protein accessionYP_002462661 
Protein GI219848228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000794512 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000764537 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCAAT CTCGATCTTT GCTTGTTTTG AGTATTGTAG TATGTATTAT CAGTATATGT 
ACAGTAGAGC CATCAATCGC TCAATCTGGA GTGGAACCGC TTGATCTTGT ATTGATTATT
GATCATTCAG GCAGTATGGA AAACCCGAAA TACGGTCGAT CAGATCCTCA TTCGATGCGT
TTTCTAGCCG CTCGTATGCT GATCGATTTG CTTAATGACG AAGATCGAGT AGGCTTGATA
TTGTTCTCGG ACAATGCAGA AGACTACTCT GATGGTCTCC AACTAGTACA GACTGGACGG
GGTCGCCTCA AGGAGAATAT TGCAAAGATG GAAAGTCAGT CTACGGGAGA TTTCACCAGA
TACAAGGATG CGTTAGAGCT TGCCGGAGAG TTGTTAGGTG AGACACCTGC AAATCGTCGT
GCAGCAGTTA TTTTCTTGAC TGATGGTGCA CCAACTGATA TGAAACAGGA GGAAGACTAC
AGTACAGCCC TCGATCTGTT CATTGATCGT AATGTGCCGG TTTTCTTGTT GATGTTAAAG
CCAAAAGAGT TTGATAACAA TGCAGTCCGT AATGATACCT TGCAACGAAT AAGTAAGACA
CTACAAATAT TTCGTGATAA CAAACAAACT GTAATCGAGA TTGATGATCC GGCAAGTATC
GCTCGTGCCT TTGCGAAGGT GATAACTGAT TTACAGCCGG GTGTCTATAT TGATGTTGAA
AATCCGCGTG GTAATCCTGA TCGCGATCAA ACGATCTTTC AAGCGAGTGT AGCTTCATCG
CAGCGTCTTG CTGATGTAAC ATTTGTTTTC TACCCAAACG AGGGCATTTT TAATCAAACG
CCTAAAATTA CTGAACAACA AAGGCCGAAT GGGGTAAAAG AGTATGACAT TCAGGAAAAT
GAAAATTACG TTACAGCTCG CTACAAAGCT GACGTTGGCA CTAATATTGC AGGTGCATGG
CAGTTTACTG CCAACGTCCC AACTGAATCG ATTAGTGCAT TCACCTTCTT TCGATCTGAT
ATTCGCATTC GTCCTCGCTA TCCCGGTTTA AAGAACCCAG CACTAATTCG TGGTCAGAAC
ACTTTGATTG GCTTCACAGT TGATGGTGTG CTGCAAACTC AGGATGTGCG ATTGCGTGTA
AATTTGCAGC AATGTCCGTT GGACAGCCGC GATCAAGCTG ACAGTCGTGT CCTACCGCTT
GAAAAGGGTC CCGATACTAC TCTGTGGACT ATTCTGAAAG ATATCGGAAA AGACTCGGAA
TACGTGTACG TCACGGTTGA GTTCGCACCG GTTAATGCGC TCTCCCTTTG GAAGTGCTTT
ACTGTTCGAG TACTACCGAA TGATCCAAAA ATGAATGTAA CCATTACTTC TGTCGAACAT
AATTCAGATG GTTCGTTCAC AGTTGAAGCT AATTTGCCAG AAGGTCATAA AAATACATCA
ATGTACGTTA CCGGACCGAG TAACTATAAT GATTTAGTGC CATTTGATGG GAACCGTGTT
CAAACGAAGC CATTACTTAC TAGCGGTGAA TACATCGTGA AAGTGATTGC TGAAGTAGAG
TATCTTGGGT TTAATGTTGC ACTAGTTGAT CAACAACCTG AAACCGTTCC ATCGTTCATC
AGTTTGAATG AGAGTGAGAG TCAATTAGCG CCAACGATAC GATGTGATGA TCGGTTGCTG
GAAGGCACCA TTGTGCTAAA TGCCCCATTC TTGCTGCAAC CCGACGAAGT TCGTTTCCAA
GTGAGTGAAA TTTCGCGTGA CGGACGGTCT GTTAACACCT TGTCCGAGAG CAGTATTGCG
CTCTGTCCAC AAGGCTTTAT GGTAGCTGAT AAGAAGGTTC GTTGTCCCTT TCAAATCAAT
CTACCAGACG GGATGGAAGC AGGTCAGTAT CGCGTGAATA TAAAAGTGGA CTCAGACAAG
CACAAGGTAA TGTTCCAGAG TATCCCTATT ACTTTCACTG TACCGCAGAC TGGAATCAAC
ATATCAAATA CGGCACTCGA TTTTGATGGA CCAATCACAC CTCTTCGGTC AGCAGCGCGA
ACGCATGTGG TGCTAACCGG ATGTCGGATT GAAAATTGGC CGTCAATCAC ATTTATCGTT
GAGTCGGTGC GAGGGTCGAA CCAAACCTAT CCTAGTAGTT ATGCATCTCT TCAACCGGAA
CCACCATCAC CAGACGGCGA TACTCGCCGT TATCAAGTCG ATGTTAGATT TGATTCATCA
TTACCTCCTG GTGAATATGA AGTTAGTGTG CAGTTGAAAA CTGACCCGCC AGTAATGATC
AACCCTTCAC CGAGAATAGT GGTGAGGGGC GAAAAAAAGA GTACAGTGGT TGAGTTGATT
GCTCCACCAA GCGATACGTT TTCGGCGATT TGGGGATTTT GGCTGCCATT TTGGCAACCT
ACGTTAACCA TTCCGCTTAC TGCAACTGTG AGCTTCGATG ATAAGATACC GGTCACTTAT
CAACCACCTA GGGTGGAATT GGTTAAGAAA GATACTGAGC GTTACTCGCC AGATACGTTT
GTTACACAGT GGATAGATGA AACACGAATC AGCAAGAATA CATTCATTAA GACAGTTGAG
CTTCGTCTTA ACAGATGGTT ATTTTTCAAT AGTGGTACTT ACACAGTGAC GCTAACAAGC
GATCCTGTCT TCAGTAAAGA TAATGAGCCA AAACAATTAG TAGTTCAGGT TCAGGTGTAT
GGATGGTGGG AATATCTGTG GAGAGGAATA GTACCGGCAA TTATTGTTTT AGGGATATTT
TTTCTTGTAG TTTTCCTTGT GTTGTATATC TTTCCTGTCC AGCACGGAAA AGTTATTGTT
GATGGTAGAG AGGTGAAGAA TCTTAGAAAA TTCAACGAGC TACGTATCAG GGCGTTACCT
GGTCGCAAAA TTAAAGTATT ATCTCCATCA GCAAAAAAAG TACTTCGTCT CGGTGAGAAA
CAAGTAATTG ATAGTAAATC TGTTGAGTAT AAACCATACA CTAGTGGACT GGCCCAATTT
TTTCTCTGGC TATTGTTGGT CTTAATGGTC TCTCCTATAT TCGCTTTGAG CTAG
 
Protein sequence
MHQSRSLLVL SIVVCIISIC TVEPSIAQSG VEPLDLVLII DHSGSMENPK YGRSDPHSMR 
FLAARMLIDL LNDEDRVGLI LFSDNAEDYS DGLQLVQTGR GRLKENIAKM ESQSTGDFTR
YKDALELAGE LLGETPANRR AAVIFLTDGA PTDMKQEEDY STALDLFIDR NVPVFLLMLK
PKEFDNNAVR NDTLQRISKT LQIFRDNKQT VIEIDDPASI ARAFAKVITD LQPGVYIDVE
NPRGNPDRDQ TIFQASVASS QRLADVTFVF YPNEGIFNQT PKITEQQRPN GVKEYDIQEN
ENYVTARYKA DVGTNIAGAW QFTANVPTES ISAFTFFRSD IRIRPRYPGL KNPALIRGQN
TLIGFTVDGV LQTQDVRLRV NLQQCPLDSR DQADSRVLPL EKGPDTTLWT ILKDIGKDSE
YVYVTVEFAP VNALSLWKCF TVRVLPNDPK MNVTITSVEH NSDGSFTVEA NLPEGHKNTS
MYVTGPSNYN DLVPFDGNRV QTKPLLTSGE YIVKVIAEVE YLGFNVALVD QQPETVPSFI
SLNESESQLA PTIRCDDRLL EGTIVLNAPF LLQPDEVRFQ VSEISRDGRS VNTLSESSIA
LCPQGFMVAD KKVRCPFQIN LPDGMEAGQY RVNIKVDSDK HKVMFQSIPI TFTVPQTGIN
ISNTALDFDG PITPLRSAAR THVVLTGCRI ENWPSITFIV ESVRGSNQTY PSSYASLQPE
PPSPDGDTRR YQVDVRFDSS LPPGEYEVSV QLKTDPPVMI NPSPRIVVRG EKKSTVVELI
APPSDTFSAI WGFWLPFWQP TLTIPLTATV SFDDKIPVTY QPPRVELVKK DTERYSPDTF
VTQWIDETRI SKNTFIKTVE LRLNRWLFFN SGTYTVTLTS DPVFSKDNEP KQLVVQVQVY
GWWEYLWRGI VPAIIVLGIF FLVVFLVLYI FPVQHGKVIV DGREVKNLRK FNELRIRALP
GRKIKVLSPS AKKVLRLGEK QVIDSKSVEY KPYTSGLAQF FLWLLLVLMV SPIFALS