Gene Cagg_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0551 
Symbol 
ID7267048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp670481 
End bp672517 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content55% 
IMG OID643565414 
ProductOligopeptidase B 
Protein accessionYP_002461926 
Protein GI219847493 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG AGGCTGTCCG TCCGCCACAT GCCGAACGCA AACCGGTGAT ACTTCGGCTT 
CACGGCGATG AGATCGTTGA TGAGTTTGCG TGGCTGGAGA ATCGTGATGA TCCGGCAGTC
ATTGCCTATC TCGAAGCCGA AAATACCTAT GCCGAAGCAG TGATGGCACC GGCTGCACCG
CTGCGTGAAC AGCTCTACGC TGAAATGCGT GGCCGGATCA AAGAGGAGGA TCGGTCGGTG
GCGGTGCCGC GGGGACGGTA TCGTTACTAC TCGCGGACTG AGGCCAGTGC TGAGTATCCG
GTGATGTGTC GTACCGAGGG CGACGATGGA CTGGAAGAGG TGGTACTTGA TCTGAATACA
TTAGCAATCG GGCATGCGTT TTGTCAGTTA GGAGCTTATG AGCCATCACC GAACCAGCAC
TTGCTGGCTT ATGGCTTAGA TACCACCGGT TCAATTATCT TTACGCTGTT CATCAAAGAT
CTCACGACCG GTGCCTTACT CGACACGCCT ATCGAGCGGG TGAATGATGT GCAGTGGGCC
GATGACCGGA CCCTCTTCTA CACCGTCTTC GATGATGCTC ACCGGGCGTA TCGCCTCTAC
CGTCATGTAC TGGGGACCTC ACCTGCCGAC GATGCGTTGA TCTATGAAGA GACCGATGAA
CGGTTTAGTC TGAGCCTACG CCGTACCCGT TCGGGAGCCT ATCTCCTCCT TACGAGTTAT
AGTCACGGTG GGACAGAAGT ACACTATGTT TCAACTGCCA CCCCATTTGC CGACTGGCAG
GTGATCTATC CGCGACGGCC CAAGATCGAC TACTTTGTCG ATCATCACGG CGATTACTTC
TACATCCGCA CGAATGACGG TGCCGAGAAT TTTCGCCTGA TCCGCGCACC GATCTCCGAT
CCGACGGCAA TGATCGAACT CGTGCCGGGT CGGGTTGATG TGCTGATCGA CCATTTCGAT
TGTTTCGCCG ACTATTTGGT GGTCTATGAG CGGCGCGATG GATTACGCCA GATTCGGATC
AGCACTCCCG ATGGTGATCA GGTGCGTTAC GTTTCGTTCC CCGAACCGGT CTACACGTGT
GGACCGCATG AAAACAAAGA GTTCGCAACT GACCGGTTAC GCCTGAGCTA CAGTTCACTC
ATCACACCGC CGTCGGTGGT TGAATACAAT ATGCGCACCG GCTCATGGCA GGTGGTGAAG
CAGGAGGAGA TTCCGTCTGG CTACGATCCA TCGCGCTACG TTAGCGAACG GCTCACTGCG
ACGGCGCCAG ATGGAGCACG GGTGCCTATT TCACTCGTTT ACCGGCGTGA TCGACCGCGT
AACGGTGGGC CTTGTCTTTT GGTTGGGTAT GGCTCGTATG GCTACAGTTA TGAGCCATCA
TTCGATAGTA AGCGCCTCAG CCTTCTCGAT CGAGGCTTTG TTGTGGCAAT TGCCCATATT
CGCGGCGGTC AAGAACTAGG GCGACGGTGG TATGAGCAGG GGCGTATGCT GCATAAGCCC
AATACGTTCA GTGACTTTAT TGCCTGCGCC GAACACCTGA TCGCTGCCGG ATACACTTCA
CCTCGTCAAT TGGCGATTAG TGGGCGGAGT GCCGGTGGTT TGCTGATGGC TGCCGTCGTT
AATGCTCGTC CCGATCTCTT TCAGGCGGTG GTCGCCGGGG TACCGTTTAC CAACGTGATT
ATCGCGATGC TCAAACCCGA TCTGCCGCTC ACCGTCACCG AATGGGAACA GTGGGGTAAT
CCGGCTATCG AAGCTGAATA TCGGGTGATG CGTTCATACG ATCCCTATCT GAACGTGAAG
CCGGGTCCGT ACCCGCACAT TCTGGCGACT GCCGGTCTCC ACGATTTGCA AGTGCCGTAC
TGGGATCCGG CCAAATGGGT GGCTAAGCTG CGTACTGTTA AAACTAATGA TACGATGTTA
CTGTTGCGCA CCAATATGCA GGCCGGTCAT AGTGGCCATT CTGGGCGCTT TGCCCGCCTC
ACCGAGTTTG CGTGGGAGTA TGCTTTTATC TTGACTGCCT TGGGAATTGC GTCGTAG
 
Protein sequence
MSVEAVRPPH AERKPVILRL HGDEIVDEFA WLENRDDPAV IAYLEAENTY AEAVMAPAAP 
LREQLYAEMR GRIKEEDRSV AVPRGRYRYY SRTEASAEYP VMCRTEGDDG LEEVVLDLNT
LAIGHAFCQL GAYEPSPNQH LLAYGLDTTG SIIFTLFIKD LTTGALLDTP IERVNDVQWA
DDRTLFYTVF DDAHRAYRLY RHVLGTSPAD DALIYEETDE RFSLSLRRTR SGAYLLLTSY
SHGGTEVHYV STATPFADWQ VIYPRRPKID YFVDHHGDYF YIRTNDGAEN FRLIRAPISD
PTAMIELVPG RVDVLIDHFD CFADYLVVYE RRDGLRQIRI STPDGDQVRY VSFPEPVYTC
GPHENKEFAT DRLRLSYSSL ITPPSVVEYN MRTGSWQVVK QEEIPSGYDP SRYVSERLTA
TAPDGARVPI SLVYRRDRPR NGGPCLLVGY GSYGYSYEPS FDSKRLSLLD RGFVVAIAHI
RGGQELGRRW YEQGRMLHKP NTFSDFIACA EHLIAAGYTS PRQLAISGRS AGGLLMAAVV
NARPDLFQAV VAGVPFTNVI IAMLKPDLPL TVTEWEQWGN PAIEAEYRVM RSYDPYLNVK
PGPYPHILAT AGLHDLQVPY WDPAKWVAKL RTVKTNDTML LLRTNMQAGH SGHSGRFARL
TEFAWEYAFI LTALGIAS