Gene Cagg_1135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1135 
Symbol 
ID7267883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1401915 
End bp1403255 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content60% 
IMG OID643565978 
Productvon Willebrand factor type A 
Protein accessionYP_002462481 
Protein GI219848048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTGC ATGTGCAGTG TGATCCGCAA CCACTGCAAT TACCACCACT GACACAGCCA 
CAGGTGGCGT ATGTTCACCT GGTGATTAGC ACTCAGGGCG ACCGCACGTT GCCGCTCCAC
CTTGTGGTGG TCGCCGATGC CAGTCGCTCG ATGCGCATTC CCATTGTCGA CGAGCACCGG
TTTCGCGAGT TGGTGCGGAA CGGTGGCGCT CACGAGGTCT TGGTCGATGG AGTACCGGTT
TGGCAATTGG CAAATCCCCT AAGCAGTGAG GCACGCAGTC AATTTTCTAG TCCGATTGAC
TATACCGTGC GCGCATTACA CAGTGTGGTT GAACGGCTCA CTCCTGACGA CCGGATGGCA
CTCATCGCCT GCGCTAGCGA TGCGCTCGTC CTCGCCCCAA GCACACCCGG CCACCGACGC
ACCGACTTGA TCGGAGCGAT TGCTCGCTTG CCCGTGCTAC GGCTTGGCGA GAGCACCAAT
CTGGCCCAAG GGTTGCAACT AGCGCTGGCC CAGTTTGTGG TCACCGATGA ACCGGCAGTA
CGACGGGTGG TGTTACTCAC CGATGGCTTC ACAACGGATA CCACCATGTG TACCGCATTG
GCCCGCGAAG CAGCAGACCG GAGTATTACG ATCAGTACGA TCGGGCTTGG AAATACATTT
GAAGAAACGC TCCTCACGCA GATTGCCGAT CTTAGCGGTG GTCGTGCCAG TTTTGTCCAA
GAGGCCGGTC ATATTCCGAC GATTATCTCC GCCGAGTTGG AGCATGCGCG CCAGACCACT
ATCCACGCGC TGAGTCTGCA CATGACGTTA CCGCAGACGG TGACACTACG CCGGATCACC
CGCCTCTCCC CAACCTTGAG TGTACTTACG CCGTTAAGCA CCGAACATGG GCGGCGCCTG
ACCCTACACC TCGGCGATCT GCGCCGTGGC GACGCAGTGC GCTTGCTCTG CGAATTTCTC
ATCGCCCCCG GCACTGCGGG GAGCCAACGC CGGCTAGCCC GCCTGCGCCT GAGGAGTGGG
CAACACGAAC AACACCATGA CCTCATCGCG CATTACGATC CGCGCGCCAC GAACCCGCCA
CCGGCACTAT TACCACTGAT TACCCACGCT ACCATTGCCC ACCTCCACCG GCGTGCCACC
CTCGCCCGCC AGCAAGGCAA CCACGAAACT GCTGCTGTTC TGCTCCACCG GCTTGCGGCC
CATCTCCGCA GCCTCGGCGA AACTGAACTG GCCACGTTGG CCCTGCAGGA AGCGAGTACT
TCTGGGCAAA TACCACTTCC CAACTTGACA ACCAAAATGC TAACCTACGC TACCCGACGA
CTAGGAGAAG CGGGTGATTG A
 
Protein sequence
MLLHVQCDPQ PLQLPPLTQP QVAYVHLVIS TQGDRTLPLH LVVVADASRS MRIPIVDEHR 
FRELVRNGGA HEVLVDGVPV WQLANPLSSE ARSQFSSPID YTVRALHSVV ERLTPDDRMA
LIACASDALV LAPSTPGHRR TDLIGAIARL PVLRLGESTN LAQGLQLALA QFVVTDEPAV
RRVVLLTDGF TTDTTMCTAL AREAADRSIT ISTIGLGNTF EETLLTQIAD LSGGRASFVQ
EAGHIPTIIS AELEHARQTT IHALSLHMTL PQTVTLRRIT RLSPTLSVLT PLSTEHGRRL
TLHLGDLRRG DAVRLLCEFL IAPGTAGSQR RLARLRLRSG QHEQHHDLIA HYDPRATNPP
PALLPLITHA TIAHLHRRAT LARQQGNHET AAVLLHRLAA HLRSLGETEL ATLALQEAST
SGQIPLPNLT TKMLTYATRR LGEAGD