Gene Tgr7_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_3016 
Symbol 
ID7318313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp3156406 
End bp3157416 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content72% 
IMG OID643617914 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_002515073 
Protein GI220936174 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.612396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCC TGGGTATCGA GACCTCCTGT GATGAGACCG GCGTGGCGGT GATCGATGCC 
GAGCGCGGGC TGCTGTCCCA TGCCCTCTAC AGCCAGGTGG CCCTGCACGC GGAATACGGC
GGCGTCGTGC CGGAGCTGGC CTCCCGGGAC CATATCCGCA AGCTGCTGCC CCTGGTGCGC
CAGGCCCTGG ACGGGGCGGA CACCGCCGCC TCGGATCTCA CCGGCATCGC CTATACCTCG
GGCCCCGGCC TGCTGGGTGC GCTGCTGGTG GGTGCCGGGG TGGCGCGCAG TCTCGCCTGG
GCCTGGGGCC TGCCCGCCGT GGGCGTGCAC CACATGGAGG GGCACCTGCT CGCGCCCCTG
CTGGGCGATG AACCGCCCGA GTTTCCCTTC CTGGCCCTGC TGGTCTCCGG CGGGCATACC
ATGCTGGTGG ACGTGCAGGG GGTGGGGCGT TACCGGATCC TGGGCGAGAG CCTGGACGAC
GCCGCCGGCG AGGCCTTCGA CAAGACCGCC AAGCTGCTCG GGCTGAGCTA CCCCGGCGGC
CCGGCCCTGG CGGCCCTGGC GGAACGGGGT GATCCGGCGC GTTTCAGCTT TCCCCGCCCC
ATGACCGACC GCCCGGGCCT GGACTTCAGT TTCTCCGGGC TCAAGACCCG GGCCCTGACC
ACCCTGCGCG AGACGCGCAC CGAGCAGGAC CGGGCCGACG TGGCGCGGGC CTTCGAGGAG
GCGGTGGTGG ATACCCTGGT GATCAAGTGT CTGCGGGCGG TGCAGGAGAC CGGGGCTGAG
CGCCTGGTGG TGGCCGGTGG CGTGGGCGCC AACCGCCGCC TGCGGCAGCG GCTCAAGGAA
GCGGTGGGCG CCGAGGGCGC CAGCGTGCAC TACCCGCCAT TCGAGTTCTG CACCGACAAC
GGCGCCATGA TTGCCCTGGC GGGGTTGATG CGTTTCCAGG CGGGTGCGGG CGAGGACCTG
ACCATCCGGG CGCGCGCCCG GTGGAACCTG GAGTCGAAGT CGGAAGTGTG A
 
Protein sequence
MRILGIETSC DETGVAVIDA ERGLLSHALY SQVALHAEYG GVVPELASRD HIRKLLPLVR 
QALDGADTAA SDLTGIAYTS GPGLLGALLV GAGVARSLAW AWGLPAVGVH HMEGHLLAPL
LGDEPPEFPF LALLVSGGHT MLVDVQGVGR YRILGESLDD AAGEAFDKTA KLLGLSYPGG
PALAALAERG DPARFSFPRP MTDRPGLDFS FSGLKTRALT TLRETRTEQD RADVARAFEE
AVVDTLVIKC LRAVQETGAE RLVVAGGVGA NRRLRQRLKE AVGAEGASVH YPPFEFCTDN
GAMIALAGLM RFQAGAGEDL TIRARARWNL ESKSEV