Gene Cagg_1673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1673 
Symbol 
ID7268975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2041779 
End bp2043176 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content56% 
IMG OID643566515 
Productadenylosuccinate lyase 
Protein accessionYP_002463010 
Protein GI219848577 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG ATATGGCTCG CTTGGCTGCG CTCGGCCCGC TCGATGGCCG TTACCGTCCT 
GATGTGGCGG CACTGGCCGG CTTTTTTAGC GAGGCAGCAC TCTTTCGGTA TCGGGTCCGG
GTCGAGGTTG AGTATCTTAT TTTTTTGTCG CGTGCTCGCG GGATCAGTTT TGTGCCCCCG
CTGACTACCT CACAACAGGC TGCCCTGCGT GCGTTGTATC GTCAGTTCGG CGACGATGAT
GCGTTAGCGA TTGCCGAATG GGACCGGCGA GTAAATCACG ATGTCAAGGC GGTTGAGTAT
TGGTTGCGCG AGCGTCTAAC AGCACTTGGT CTAACATCCC ATCTGGAAGC GATCCATTTC
GCAATTACCT CCGAAGATGT CAATAATTTA GCCTACGCGC TGATGGTCAA GGAGGCGCGT
GAGCTGGTGA TGTTGCCCGC ACTTGAAGCG ATTCTCGAAC GGTTACGCCA ATTGGCCGAT
GAAGAAGCAG CGACGCCAAT GCTGGCGCGT ACCCATGGGC AGCCCGCCAC CCCAACCACG
TTCGGCAAAG AGATGAATGT GTTTTTCATG CGGTTACGAC GGGCCATCGC CGATCTGATG
GCTATCCGAA TCACCGGTAA GTTGAACGGC GCCAGTGGTG TCTTTGCCGC TCATTATGCG
GCGTTGCCGC AAGTTGACTG GTTGAAGTTT TCGCGTGCCT TTGTTCGCTC GCTTGAACTT
GAGCCGATCT TGCTGACCAC CCAGATTGAG CCACACGATA CGCTCGCCGC CCTGTGTGAT
GCGTTCAAGC GGATTGGTGC GATTCTGACC GACCTGAGCC AGGATTGCTG GCGTTATATT
AGTGATGGCT ATCTGGTGCA GGCGGCCGAT GCCGGTGAAG TCGGTTCTTC CACTATGCCG
CACAAGGTAA ACCCGATTGA CTTTGAAAAT GCCGAGGGGA ATCTCGCTGT CGCCGGCACA
TTGCTCGAAC TGTTTAGCCG TAAGTTGCCG GTGTCGCGCT TGCAACGCGA TTTGTCCGAT
AGCACTGTCT TGCGTAACCT TGGGTTGGCC TTCGGCTATT GTCTCCTCGC CTATCAACGG
TTGCTGCGCG GTCTGACGAA GGTAGCGGTA GACCGATCTC GGCTACGTCG CGACCTCGAG
GCCCATCCTG AAGTGTTGGC CGAAGCGATC CAGACCATTC TCCGCCGTGA AGGGTTTGCG
CAACCCTATG AGTTGCTGAA GGATTTTACG CGCGGACGAG CACTAACTGC CGAAGAATTG
GCTCGCTTTA TCGCGAGCTT GCCGGTGAGT GATGCTGTGC GTGCTGAGTT GCAGGCACTC
TCTCCTGTCG CGTATATCGG GTTGGCTGTA AAGCTTGCCC AACTTCGCGA TGAGGCGACC
GTCGGTAACT GGCTGTAG
 
Protein sequence
MTNDMARLAA LGPLDGRYRP DVAALAGFFS EAALFRYRVR VEVEYLIFLS RARGISFVPP 
LTTSQQAALR ALYRQFGDDD ALAIAEWDRR VNHDVKAVEY WLRERLTALG LTSHLEAIHF
AITSEDVNNL AYALMVKEAR ELVMLPALEA ILERLRQLAD EEAATPMLAR THGQPATPTT
FGKEMNVFFM RLRRAIADLM AIRITGKLNG ASGVFAAHYA ALPQVDWLKF SRAFVRSLEL
EPILLTTQIE PHDTLAALCD AFKRIGAILT DLSQDCWRYI SDGYLVQAAD AGEVGSSTMP
HKVNPIDFEN AEGNLAVAGT LLELFSRKLP VSRLQRDLSD STVLRNLGLA FGYCLLAYQR
LLRGLTKVAV DRSRLRRDLE AHPEVLAEAI QTILRREGFA QPYELLKDFT RGRALTAEEL
ARFIASLPVS DAVRAELQAL SPVAYIGLAV KLAQLRDEAT VGNWL