Gene Haur_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1195 
Symbol 
ID5733088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1377217 
End bp1378281 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID641278335 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001543971 
Protein GI159897724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGC AAATGCCCGC AGGTTTTACG ATTTTAGCAA TTGAAACATC GTGCGATGAA 
ACCGCCGCCG CCGTTATTCG CGATGGCCGC GAGATTGTGG CCAATAGCGT CGCTTCACAA
ATCGATATTC ATCAACGCTA TGGTGGCGTT GTGCCCGAAG TGGCTTCGCG CCAACATATT
CTGACGATTA CACCCGTGAT TAATACGGTG CTGGCCCAAG TGCCCGGCGG TTGGAGCGCG
ATTAATGCGG TTGCCACAAC TTATGGACCA GGCTTAGCTG GCGCTTTATT AACTGGAATT
AACACCGCCA AGGCGATTGC GTGGTCGCGC AATTTGCCGT TTATCGGGGT TAATCACCTC
GAAGGCCATA TTTATGCTTC GTGGCTGCAT ACTGCCAAAG ATCTAGCCTA CCAAGCCCCC
GAATTTCCGT GTGTCGCCTT GATTGTTTCT GGTGGCCACA CGGCCTTGGT GTTGCTCAAC
GATCATGGTG ATTATCGTTT GCTTGGGCAA ACCCGCGATG ATGCAGTGGG CGAGGCTTTC
GATAAGGTAG CGCGAATTAT GGGCTTGGGC TATCCAGGCG GCCCGCAAAT GGAAAAAGTC
GCGCGTGGGG TTAATCCAGG TGCACTCAAA TTGCCCCGAG CATGGTTGCG CGGAACCTAC
GATTGGAGCT TTAGCGGCCT CAAAACCGCT GTGCTGAACG TGGTCAATGA TCGTTTGGGT
GAGCGCATGG AGAAAGTTAA ACTTGCTGAA GTTGACCCAG CCTTCACCGC CCAGCTTGCC
GCTGCCTTCC AAGATTCGGC AGTCGATGTG TTGGTGCAAA AAACCGTGGC TGCTGCTCGC
GAATACAAAG CCAAAACGAT TATTTTAGCT GGTGGCGTAG CCGCAAATAC TGCCTTACGC
GAGCGCTTGA GCGCAACCGC CAAGCCTATT CCGGTAGCCT ACCCGCCAAT TTGGCTGTGT
ACCGATAATG CCGCCATGAT CGGCGCTGCC GCCTATTATC GTTATCAAGC AGGCGTGCAG
CAGGATTGGT CGCTCGATGC CACTCCCAGT CTCAAGCTGA TTTAA
 
Protein sequence
MSQQMPAGFT ILAIETSCDE TAAAVIRDGR EIVANSVASQ IDIHQRYGGV VPEVASRQHI 
LTITPVINTV LAQVPGGWSA INAVATTYGP GLAGALLTGI NTAKAIAWSR NLPFIGVNHL
EGHIYASWLH TAKDLAYQAP EFPCVALIVS GGHTALVLLN DHGDYRLLGQ TRDDAVGEAF
DKVARIMGLG YPGGPQMEKV ARGVNPGALK LPRAWLRGTY DWSFSGLKTA VLNVVNDRLG
ERMEKVKLAE VDPAFTAQLA AAFQDSAVDV LVQKTVAAAR EYKAKTIILA GGVAANTALR
ERLSATAKPI PVAYPPIWLC TDNAAMIGAA AYYRYQAGVQ QDWSLDATPS LKLI