Gene Cagg_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1902 
Symbol 
ID7266393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2333616 
End bp2334668 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content57% 
IMG OID643566739 
Productsortase family protein 
Protein accessionYP_002463233 
Protein GI219848800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0102426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.943943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGAT TACTTATCCT TTGTCTGATA GTATTGCTTC TTAGCCCGTT ACCGGTCGCT 
GCCAATACGG CTGCCGGTCA ACCGACCGTC TTCCGTGAGA CCGGTCATAC GTTGGCATAT
GCCTTTCGTG AATTCTACGA CCGGCAAGGT GGTCTACCGA TTTTTGGCTA TCCACTCACC
GAGGTGTTTA TTGAAGATGG CCGTCCGGTG CAGTATTTCG AGCGTGCCCG CTTCGAGTGG
CACGCCGATT TGGCGTTGGT GCAGGTCGGG CATCTTGGGC GATGGGCGGC AACGGCGTAT
GTCGATCATC CGGCGTTTGT ACCATTACCG ACAGCTCCGG CAAATGCCGA TTTCTTTCCC
GAAACCGGTC ATAGTCTGAG TGGGGCTTTT CGTACTTTCT GGTGGCAAAA CGGTGGGTTG
CCGACGTTCG GTTTCCCGCT ATCAGAACCG TTTGAGACCG TCGATGAGAA TGGTCAGCCG
CGTGTGGTCC AGTTCTTTGA GCGGGCACGC TTTGAGTGGC ATCCGCAGAA CCCACCCCGC
TACCAGGTGC TGCTCGGACA TTTGGGACGG GCATGGTTGG CCGCACATCC GGTGCCGGAA
TGGTCACTAC AACCGGTGAC AAGCGGTGAT GCTGCGTGGG CTGCGGTTCG TCCGACGCGC
GTGCGGGTAC CCCGCATCGG TGTCGATACC GAGGTTGTCA GTGCGGGATT TTCGTTTGGG
GTGTGGGACG TTCCACGCTA TACGGCCGTC CACTACTGGC CGATCAGTGG CTATCCCGGC
ACAACCGGCA ATATCGTGAT TGCCGGCCAT GTTGGGTATC GTGGAATTAT TTTCAATCAG
CTACCGGCAA TTACGGTCGG CGATGAAGTA TTGGTCACGG TGAACGGTAA CGACCGTCGC
TATGTGGTGC GTGAGGTTTT GACGGTGCTT CCCGATGCGA CGTGGGTGCT CGCACCGACA
AGCAGCGAAA CGCTCACGCT GATTACCTGT GTACCGATTG GAGTCTATTC CCATCGGCTC
ATTGTGCGTG CCACACCGGT GACCGATTCG TAA
 
Protein sequence
MRRLLILCLI VLLLSPLPVA ANTAAGQPTV FRETGHTLAY AFREFYDRQG GLPIFGYPLT 
EVFIEDGRPV QYFERARFEW HADLALVQVG HLGRWAATAY VDHPAFVPLP TAPANADFFP
ETGHSLSGAF RTFWWQNGGL PTFGFPLSEP FETVDENGQP RVVQFFERAR FEWHPQNPPR
YQVLLGHLGR AWLAAHPVPE WSLQPVTSGD AAWAAVRPTR VRVPRIGVDT EVVSAGFSFG
VWDVPRYTAV HYWPISGYPG TTGNIVIAGH VGYRGIIFNQ LPAITVGDEV LVTVNGNDRR
YVVREVLTVL PDATWVLAPT SSETLTLITC VPIGVYSHRL IVRATPVTDS