Gene Cagg_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3090 
Symbol 
ID7269507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3748995 
End bp3750932 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content59% 
IMG OID643567910 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_002464384 
Protein GI219849951 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0547803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000201646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACCA ACCTCAAGAT TGCGTCGATT CCCCAGATCG AGATCGATCC GGTGACGCTC 
GATATTATCG AGAATGCGCT CAAGAATACC CGCTACGAAA TGGACGCCGT CCTCTACCGC
ACGGCGATGA GTCCGGTGAT CCGCGAACAG CACGATCAGT TTCCGATGAT TACCGACCCG
CACGGGCGGA TGATCGTCGG CCAGTTCGGT TCGTATATCG CCGGTCTCCT TGACCATTGG
GATCAGACCA TCGAACCCGG TGACGTGATT TTGCTCTCCG ATCCGTACCT CTGTGGCGGC
GCGATCAGCC ACATCAACGA CCTACTGGTA ATGTTGCCGA TCTTTTACGG CGAGGGTGAA
GATCGCGAAC TGATCGGCTG GGCCAGTATG TTCGGCCATG CCCAAGATGT GGGTGGTCCA
CTGCCCGGAT CGCTGCCAAC CAACGCCACC ACCATCTTCG GCGAAGGGTT GCGGCTGCCA
CCGGTTAAGA TTTACGAACG CGGCAAACTC AACCGAGCCG TGCTCGACAT CATGCTTAAC
AACGTCCGCC AGCCCGAAAT GAACCGCGCC GACCTGATGG CGATCATCGC CAGTTGCCGC
ACCGCCGAAA AGCGGGTCAT CGAGCTGTGC GACCGCTTCG GCAAAGATGT CTACCTGGCG
GCTACCCAAG CCCTGCTCGA CCGCACCTAT CGGGCGATGA AGGAGCTGAT CCTGCGCAAC
CTGCCCGAAG AGCCACAGTC GTTTGAAGAT TATGTTGATG ACGACGGGTT GGGCAACGGG
CCGTTCAAGA TGAAGCTGAC GATCTGGCGC GAGGGTCATG AAGCCTACTT TGACTGGACG
GGCACCGACC CGCAGGCGAT GGGGCCGATC AACTTCTACC TCAACGAATC GATGTTCAAA
ATGTTCATCG GCGTCTACCT GATTATGGTT TTTGACCCGC AAATCCTGTT CAACGACGGC
TTCTACCCGC TGCTGCATGT CACTATGCCC AAAGGCAGCC TTATCCAGCC GGAGTTTCCG
GCTGCACTGG GATGCCGCAC CCATGCTCTC ACCCGCCTCT TCGACGTGCT CGGCGGTGCG
CTGGCCAAAC GTGCGCCCGA GTTCACGACT GCTGCCGGCT ATGGCACCTC GCCCTACCTG
CTCTATTCCG GTTACGATAA GGACGGTGAA TTCTTCTATC TGATGGAGAT CAACTACGGT
GGTATTCCCG GTCGTCCGAT TGGCGACGGA ATGGACGGAC ACTCGTGGTG GCCGCTCTTT
ACCAACATTC CCACCGAATA CCTTGAGAGC TACTATCCGA TCCGGATCGA ACGCTACACC
AGTATTATGG ACTCAGGTGG CGCCGGCTTC CACCGCGGCG GCAACGGCAT CGAGAAGATC
TATACCTTCC TCGAACCGGG AGAAATCTCG ATCCACGATG ACCGCTGGCT GACGCCGCCG
TGGGGCAACG TCGGCGGTAA GCCGGGTAGT CGGTCAACGA AGATTCTCGT GCGTACCGAT
GGCACACGCA AGGTACTGCC CAGCAAGTGC GACCAAATTG CGGTCGAGCC AGGTGATCAA
TTGATCTACC GCACTGCCGG TGGCGGCGGT TGGAAAGACC CGCTTACCCG CCCACCGGAG
GCCGTGCAGC GTGATGTGCG CTACGGGTTG GTCAGCCGCG AGAAGGCCCT GAATGACTAT
GGCGTCGTCC TGACCGACAC GTTAGACATC GACCTGGCTG CTACCGAAGC CAAACGCGCC
GAGCTGGCTG CTGCCCGTGG CGAAATCAAA GGCTTCGACT TCGGACCGCC GCTGGAAGAG
TTGCTGACCA ATGCCGAAGC CGAAACCGGC TTGCCGGCAC CGCGTAAACC GCAACCGGTG
CGATGGGCGA TGGCCCGCGC TGCCCGTCGT GAGCGTATGG CGCAACCTGT CAAGCAACAC
GAAGTCACAG CCGACTAG
 
Protein sequence
MATNLKIASI PQIEIDPVTL DIIENALKNT RYEMDAVLYR TAMSPVIREQ HDQFPMITDP 
HGRMIVGQFG SYIAGLLDHW DQTIEPGDVI LLSDPYLCGG AISHINDLLV MLPIFYGEGE
DRELIGWASM FGHAQDVGGP LPGSLPTNAT TIFGEGLRLP PVKIYERGKL NRAVLDIMLN
NVRQPEMNRA DLMAIIASCR TAEKRVIELC DRFGKDVYLA ATQALLDRTY RAMKELILRN
LPEEPQSFED YVDDDGLGNG PFKMKLTIWR EGHEAYFDWT GTDPQAMGPI NFYLNESMFK
MFIGVYLIMV FDPQILFNDG FYPLLHVTMP KGSLIQPEFP AALGCRTHAL TRLFDVLGGA
LAKRAPEFTT AAGYGTSPYL LYSGYDKDGE FFYLMEINYG GIPGRPIGDG MDGHSWWPLF
TNIPTEYLES YYPIRIERYT SIMDSGGAGF HRGGNGIEKI YTFLEPGEIS IHDDRWLTPP
WGNVGGKPGS RSTKILVRTD GTRKVLPSKC DQIAVEPGDQ LIYRTAGGGG WKDPLTRPPE
AVQRDVRYGL VSREKALNDY GVVLTDTLDI DLAATEAKRA ELAAARGEIK GFDFGPPLEE
LLTNAEAETG LPAPRKPQPV RWAMARAARR ERMAQPVKQH EVTAD