Gene Cagg_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2191 
Symbol 
ID7266764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2685012 
End bp2686652 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID643567022 
Productvon Willebrand factor type A 
Protein accessionYP_002463510 
Protein GI219849077 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACAC GACCGCTGAA ACTGCTTACG CTACTGACCC TACTGATCAT CCTGCTCAGC 
GGCTGTGGTA ATCTTAACGG ATTACTCAGC GGAAGTGACG CCAATGCAAT CACGATTACC
ATCGCCTATA GCCCGGAAAA GGACAAATGG CTGACCGAGC AGATTTCCCG TTTCAACGAA
CAGCGGCTCA CCGTAAACAA CCGCCGAGTG CGCGTCGAAG GGATTAACAA GTCTTCGGGC
GCAGCCCGCA CCGAGATTAA GAACGGCACG CTCCAAGTAA CGGTATGGAG TCCATCGGCC
AGTACGTGGT TGGAAGTACT GAAGCAAGAG ACCGGTAATC AGAATGTGGC CGTTTCCAAC
AAGCCGCTCG TGCTCACCCC GGTCGTAATC GCAATGTGGC AGCCAATGGC CGAGGCGATG
GGCTGGCCGA ACAAGCCGAT CGGCTGGAAG GATATTCTCG ACCTAACCAA CGACCCGCAG
GGCTGGGGCC GCTTCGGTCA TCCAGAATGG GGCCGTTTCT CGTGGGGCCA TACCGACCCC
GAAATCAGTA CAACCGCGTT GAGTACCCTC ATTGCCGAGT TCTACGCTGC AACCGGCAAA
CGTGAAGGAT TAACCATCGC CGACGTCCAG AGCGAGGAAG CAACCCGCTT CATCCGTGAC
TTAGGGCGCA GTATTAAGCA CTACGGCTAC AATACGTTGA TCTTCAGCGA AAACATGAAG
AAGTTTGGTA TGAGCTACAT CTCGGCTTTC CCAATGGAGG AGATCACGCT CATCGACTTC
AACAAGTTTA ATCCACCACT AACACCGCTG GTCGCCATCT ATCCGGCAGA AGGTACTTTC
TGGCACGACA ATCCCTTCAT CATTATGGCA AGTGCCAATT CCGACGAACG TGACGCTGCC
GAGCGGTTCT ACGAATTCCT GCTCAGCGAA GAGAGCCAGC GGGCCGCGAT GTCGTTCGGC
TTTCGTCCTG CCAACCCCAA CGTGCCGTTG ACCGATCCGA TCAGCCCGGC GTTTGGTGTT
GATCCGCAAG GCGTGCAGAC CGTCTTAGCT GTGCCGACGG CAGAGGTGAT CGTCGCGATC
AAGAACTCGT GGTCACTCAA CCGCAAGCGG GCCGACATTG TGCTGGTAGT TGACACGTCT
GGCTCGATGG AAGGCGATAA GCTCACTATG GTCAAGGCCG GGATCGAGAC CTTTCTGATG
CGGATTTTGC CTGAAGATCG CCTCGGTCTG ATCACCTTCG CCTCAGCAGC CCGATTAGTG
GTTCCGATGG CACCACTCAG CGATAATCGG ATCGCCTTGC AAGATGCGGT GCAAGCGATG
CGTGCCAGCG GGCGTACTGC ATTGTTCGAT GCATTGGTAC TTGGGAAACA GGTTCTTGAA
CAGTTACCAC CGGCAGACGA TGATCGCATT CGCGCGATTG TGTTGCTCTC GGATGGTGCT
GATAATTCAA GTCAGGCATC ACTCGACCAA ATCCGCACCT TGTTCGACGA GAGCGGGATC
AGCATCTTCC CGGTAGCCTA CGGGAACGAC GCCGACCGAC AGGTGCTTGA TGCGATTGCC
GAATTTTCGC GCACGATTGT CGTGGTCGGC GATACCGGTG ATATTGCCCA GATTTTTGAA
AATCTGAGCA GGTATTTCTA A
 
Protein sequence
MITRPLKLLT LLTLLIILLS GCGNLNGLLS GSDANAITIT IAYSPEKDKW LTEQISRFNE 
QRLTVNNRRV RVEGINKSSG AARTEIKNGT LQVTVWSPSA STWLEVLKQE TGNQNVAVSN
KPLVLTPVVI AMWQPMAEAM GWPNKPIGWK DILDLTNDPQ GWGRFGHPEW GRFSWGHTDP
EISTTALSTL IAEFYAATGK REGLTIADVQ SEEATRFIRD LGRSIKHYGY NTLIFSENMK
KFGMSYISAF PMEEITLIDF NKFNPPLTPL VAIYPAEGTF WHDNPFIIMA SANSDERDAA
ERFYEFLLSE ESQRAAMSFG FRPANPNVPL TDPISPAFGV DPQGVQTVLA VPTAEVIVAI
KNSWSLNRKR ADIVLVVDTS GSMEGDKLTM VKAGIETFLM RILPEDRLGL ITFASAARLV
VPMAPLSDNR IALQDAVQAM RASGRTALFD ALVLGKQVLE QLPPADDDRI RAIVLLSDGA
DNSSQASLDQ IRTLFDESGI SIFPVAYGND ADRQVLDAIA EFSRTIVVVG DTGDIAQIFE
NLSRYF