Gene Cagg_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2145 
Symbol 
ID7267653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2635714 
End bp2637075 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content56% 
IMG OID643566977 
ProductSAF domain protein 
Protein accessionYP_002463465 
Protein GI219849032 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4091] Predicted homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.342497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000671283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTCTCG TTGACACTGC TCTCGCCCGT GCTGAAACCG AAGGACGCCC GATCCGGGTA 
GGCATGATCG GTGCCGGTTT TATGGCGCGC GGTATTGCTC TCCAGATTAT TCGCTACACC
CGTGGGATGC GTTTAGTGGC AATTGCTAAC CGTACTATTG AGCGCGCTAT CCAGGCCTAT
ACCGAAGCCG ACGTGCCTGC CGAGGCGATT CGGCGGGTTA CCTCGGCAAC TGCGCTCTCC
GAGGCACTGG CCGCCGGCGC ACCGGCGGTC ACCGATGACG CACTCCTGCT CTGCGCTGCT
GAGGGTATCG ATGTCATTCT CGAAGTCACC GGTGCGGTGG AGTTTGGAGC GCACGTGGCA
CTGGCTGCTA TGCAACACGG CAAACACGTC GTGACAATGA ATGCCGAACT CGATGGCACT
CTCGGCGCGA TTCTACAAGT CTATGCCCGT CGTTACGGCG TTATCTTTAC GCTGTCCGAC
GGTGATCAGC CCGGAGTCAC GATGAATCTC TATCGGTTTG TTCGTGGATT AGGGGTCAAA
CCGGTGCTGT GCGGTAACAT CAAGGGCTTA CACGATCCGT ACCGTAACCC AACTACCCAA
GCCAACTTTG CCCGGCAGTG GGGACAAAAT CCGTACATGG TGACGAGCTT TGCCGATGGC
ACGAAGATTT CGTTTGAACA GGCGGTCGTT GCCAATGCCA CCGGTATGCG GGTGGCACGA
CGCGGTATGT TCGGCCCGAC CGTTCCCTCA GGAACACCAC TCGCCGATGT CGTACATGAC
CTTTACCCGC TAGAGGCACT GATCGAAGGG CCGGGGATTG TCGATTATGT CGTTGGGGCG
ACACCGGGAC CGGGTGTGTT TGTGCTGGGT ACCCACGATC ATCCACGCAT GCAGCACTAT
CTCAACTTGT ACAAGTTGGG GAAGGGTCCC CTTTACCTCT TCTATACGCC GTACCATCTT
TGCCACTTTG AAGTGCCCAA TTCAATCGCG CGTGTGGCAC TGTTCGGCGA TCAGGTGTTA
GCCGCTGCCG GCCGACCAAT GGTTGAGGTT ATTACATCAG CCAAGACCGA CCTACATGCC
GGTCAGACTC TTGATGGGTT GGGCGGCTAC ATGACGTATG GCTTAGCCGA GAATGCCGAT
GTTGTTTACG CAGAACGTTT GTTGCCGATC GGATTAGCCG AAGGGTGTAC CTTGCGACGT
GATATTCCCA AAGATGGCAT CATCACTTAC GACGATGTGG AGTTACCGAC CGATCGGCTG
AGTGATCGTT TGCGTGCCGA GCAAGATGCG TTGTTTTGGG GTAAACCGGC TACTGCCGGC
GCATCCGAAG GACAATACCA ACGCATAGCG CACACAGAGT GA
 
Protein sequence
MILVDTALAR AETEGRPIRV GMIGAGFMAR GIALQIIRYT RGMRLVAIAN RTIERAIQAY 
TEADVPAEAI RRVTSATALS EALAAGAPAV TDDALLLCAA EGIDVILEVT GAVEFGAHVA
LAAMQHGKHV VTMNAELDGT LGAILQVYAR RYGVIFTLSD GDQPGVTMNL YRFVRGLGVK
PVLCGNIKGL HDPYRNPTTQ ANFARQWGQN PYMVTSFADG TKISFEQAVV ANATGMRVAR
RGMFGPTVPS GTPLADVVHD LYPLEALIEG PGIVDYVVGA TPGPGVFVLG THDHPRMQHY
LNLYKLGKGP LYLFYTPYHL CHFEVPNSIA RVALFGDQVL AAAGRPMVEV ITSAKTDLHA
GQTLDGLGGY MTYGLAENAD VVYAERLLPI GLAEGCTLRR DIPKDGIITY DDVELPTDRL
SDRLRAEQDA LFWGKPATAG ASEGQYQRIA HTE